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Modtaget PV2 

Methods for increasing the production of a recombinant polypeptide from a 
host cell 

Field of the Invention 

The present Invention relates to methods for enhancing the production of a polypeptide 
5 from a cell by disrupting the synthesis or activity of a pltrilysin metalloprotease. In 
particular, the present invention relates to methods for enhancing the secretion of 
recombinant polypeptides from host ceils such as, but not limited to, yeast and 
bacterial cells. 

Background of the Invention 

10 Cholecystokinin (CCK) is a vertebrate neuroendocrine peptide hormone that is 
expressed in both gut and brain tissues. The maturation of bioactive CCK peptides 
depends on post-translational tyrosine sulfation, endoproteolytic cleavages, 
exoproteolytlc trimmings and carboxyterminal amidation. The endoproteolytic 
processing of the N-terminus varies with CCK-83, -58, -39, -33, -22, -8 and -5 being 

15 identified. Most of the CCK peptides are synthesized after cleavage at a single Arg 
residue, however, CCK-22 requires processing after a single Lys residue. 

Many recombinant polypeptides have been expressed in yeast as a fusion protein to the 
Saccharomyces cerevisiae a-factor prepro-peptide to direct secretion through the 

20 secretory pathway. The best characterized yeast protease is the serine endoprotease, 
Kex2p (Fuller et aL, 1989) which is involved in maturation of the a-mating pheromone 
and of killer toxin {Julius et al., 1984). Another yeast protease is Ypslp belonging to 
the yapsin family of glycosyl-phophatidylinositol (GPI)-anchored aspartyl proteases, 
which Is able to rescue mating deficiency when overexpressed In a kex2 mutant (Egel- 

25 Mitani et al., 1990). Expression of foreign proteins have shown that Ypslp and Yps2p 
contain endoprotease activity. 

The use of host cells for the expression of recombinant polypeptides has greatly 
simplified the production of large quantities of commercially valuable polypeptides, 

30 which otherwise are obtainable only by purification from their native sources. There Is a 
varied selection of expression systems currently available from which to choose for the 
production of any given polypeptide, Including eubacterial and eucaryotic hosts. One 
important factor in the selection of an appropriate expression system is the ability of 
the host cell to produce adequate yields of the polypeptide. However, a problem 

35 frequently encountered is the high level of proteolytic enzymes produced by a given 
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host cell or in the culture medium. Accordingly, there is a need for further methods 
which enhance the production of a recombinant polypeptide from a host cell. 

Metal loproteases are the most diverse of the four main types of protease, with more 
5 than 30 families identified to date. In these enzymes, a divalent cation, usually zinc, 
activates the water molecule. The metal ion is held in place by amino acid ligands, 
usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least 
one other residue is required for catalysis, which may play an electrophillic role. Of the 
known metailoproteases, around half contain an HEXXH motif, which has been shown in 
10 crystallographic studies to form part of the metal-binding site. 

A number of proteases dependent on divalent cations for their activity have been 
shown, to belong to a single family, peptidase M16. Included are insulinase, 
mitochondrial processing protease, pitrilysin, nardilysin and a number of bacterial 

15 proteins. These proteins do not share many regions of sequence similarity; the most 
noticeable is in the N-terminal section. This region includes a conserved histidine 
followed, two residues later by a glutamate and another histidine. In pitrilysin, it has 
been shown that this H-X-X-E-H motif is involved in enzymatic activity; the two 
histidines bind zinc and the glutamate is necessary for catalytic activity. The X can be 

20 any amino acid. Non active members of this family have lost from one to three of these 
active site residues. 

It has previous been suggested that one could provide host cells and methods of 
producing proteins by expressing significantly reduced levels of a genetical modification 
25 in order to express significantly reduced levels of a metalloprotease containing an 
HEXXH motif in a filamentous fungal host ceil, in e.g. US 5,861,280. However, 
metalloprotease which can be reduced by a genetical modification in order to express 
significantly reduced levels of said metalloprotease in a non-filamentous fungal host ceil 
and other cells containing an motif other than HEXXH has never been described. 

30 Summary of the Invention 

Whilst Investigating the role various proteases play in processing proCCK in 
recombinant yeast, the present inventors surprisingly noted that the deletion/disruption 
of CYMX enhanced recombinant polypeptide production and secretion. Furthermore, the 
present inventors have found that Cymlp belongs to a family of metailoproteases, the 
35 activity of which can be down-regulated to enhance the levels of recombinant 
polypeptide produced from a host cell. 

Accordingly, in a first aspect the present invention provides a host cell comprising a 
nucleic acid sequence encoding a recombinant polypeptide in which the production of a 
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naturally occurring metallopro tease comprising a sequence provided in SEQ ID NO:l 
has been reduced or Inhibited by genetic manipulation. 

The host cell can be any cell which, in Its native state, possesses the metalloprotease. 
5 Accordingly, the host cell can be a eukaryotic or prokaryotic cell. Examples of preferred 
eukaryotic cells include, but are not limited to, mammalian cells, plants cells and fungal 
cells. In a preferred embodiment, the host cell is a yeast cell. More preferably, the 
yeast cell Is selected from, but not limited to, the group consisting of: Saccharomyces 
sp. such as Saccharomyces cerevisiae and Saccharomyces kluyveri, 
10 Schizosaccharomyces sp. such as Schizosaccharomyces pombe, Kluyveromyces lactis, 
Hansenula polymorpha, Pichia pastoris, Pichia methanolica, Pichia kluyveri, Yarrowia 
lipolytica, Candida utilis, Candida cacaoi, and Geotrichum fermentans. 

The metalloprotease is a member of the pitrllysin subfamily of proteases, characterized 
15 by comprising the sequence HXXEH (SEQ ID NO:l) where X is any amino acid. In a 
preferred embodiment, the metalloprotease comprises a consensus sequence provided 
in SEQ ID NO:2. Even more preferably, the metalloprotease comprises a consensus 
sequence provided in SEQ ID NO: 3. In addition, it Is preferred that the metalloprotease 
comprises SEQ ID NO:i and a glutamic acid residue between 70 and 80 amino acids C- 
20 terminal of the second His residue. Further, it is preferred that the metalloprotease 
comprises a sequence selected from the group consisting of SEQ ID NO:l, SEQ ID 
NO:2 and SEQ ID NO:3 as well as a sequence selected from the group of: 

i) any one of group consisting of SEQ ID NO's 4 to 15, and 

ii) a sequence which is at least 80% identical to any one of SEQ ID NO's 4 to 
25 15. 

More preferably, the metalloprotease comprises SEQ ID NO:l or SEQ ID NO: 2 or SEQ 
ID NO: 3 as well as a sequence selected from the group of: 
i) any one of SEQ ID NO's 4 or 5, and 
30 H) a sequence which is at least 80% identical to any one of SEQ ID NO's 4 or 

5. 

Preferably, the metalloprotease comprises a sequence which is at least 85% identical, 
such as at least 90% identical, such as at least 95% identical, and such as at least 99% 
identical to any one of SEQ ID NO's 4 to 15. 

35 

In a particularly preferred embodiment, the metalloprotease comprises a sequence as 
provided in SEQ ID NO:4, or a sequence at least 80% identical, such as at least 90%, 
such as at least 95% and such as 99% identical, thereto. 
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The host cell can be genetically manipulated by any means known in the art as long as 
the production of the metalloprotease is reduced or inhibited when compared to a 
parental host cell which has not been genetically manipulated. Such means of 
5 genetically manipulating the host cell include, but are not limited to; gene knockout, 
gene disruption, random or site directed mutagenesis, introduction of dominant- 
negative metalloproteases, RNA interference (RNAi) using dsRNA, catalytic nucleic acids 
(such as ribozymes and DNAzymes), and antisense nucleic acids. Preferably, the 
genetic manipulation acts directly upon the gene encoding the metalloprotease, the 

10 mRNA transcribed from the gene, or produces a protein that alters the activity of the 
metalloprotease such as a dominant negative mutant which competes with the 
metalloprotease for binding to a substrate but does not, for example, possess catalytic 
activity. However, the host cell may be genetically manipulated such that it indirectly 
affects the production or activity of the metalloprotease. For instance, the genetic 

15 manipulation can target a transcription factor Involved in transcribing the mRNA 
encoded by the metalloprotease gene, thus at least reducing the levels of 
metalloprotease produced by the manipulated host cell. 

Furthermore, the host cell may be further genetically manipulated such that it lacks at 
20 least one other naturally occurring protease of the host cell or has reduced activity for 
at least one other naturally occurring protease of the host cell. The protease can be any 
enzyme of which the Inhibition increases the production of a recombinant polypeptide 
produced by the host cell. The protease can either be an endopeptidase, an 
aminopeptidase or a carboxypeptidase. Preferred proteases include, serine proteases, 
25 aspartyl proteases, cysteine proteases and other metalloproteases. 

In one embodiment, the host cell is a yeast cell and the other naturally occurring 
protease(s) is at least one protease selected from the group consisting of; KEX2, YPS1 
(previously known as YAP3), YPS2 (previously known as MKC7), YPS3, YPS6, YPS7, 
30 BAR1, STE13, KEX1, PRC1, PEP4 (also known as PRA1), APE1, APE2, APE3 and CPS1. 
Preferably, the host cell is a yeast cell and KEX2 production has been disrupted. 

The recombinant polypeptide can be any desired polypeptide which Is capable of being 
produced in the host cell. The recombinant polypeptide can comprise a naturally 
35 occurring sequence or have been produced by the intervention of man (e.g. a mutant 
or truncation of a naturally occurring protein, or a fusion between at least two different 
polypeptides). Typically, the recombinant polypeptide will be of commercial value, for 
example in the treatment of diseases. 
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The recombinant polypeptide can be any size. Typically, the recombinant polypeptide 
will range in size from about 30 amino acids to about 4,500 amino acids. In one 
embodiment, the recombinant polypeptide is between about 30 to about 200 amino 
5 acids In length. 

In at least some host cell expression systems for producing recombinant polypeptides, 
It is desirable to direct the recombinant polypeptide to be secreted from the host cell. 
Thus, in a preferred embodiment the nucleic acid comprises a sequence which encodes 

10 a signal for directing the recombinant polypeptide to be secreted from the host cell. 
Preferably, the signal Is an N-terminal hydrophobic signal sequence. Such N-terminal 
hydrophobic signal sequences are known in the art, and include, for example but not 
limited to, the leader sequence originating from the fungal amyloglucosidase (AG) gene 
such as galA - both 18 and 24 amino acid versions e.g. from Aspergillus sp., the cc- 

15 factor gene such as yeasts e.g. from Saccharomyces sp. and Kluyveromyces sp., the P- 
factor of Schizosaccharomyces sp., and the ct-amylase gene from Bacillus sp. In one 
embodiment, the recombinant polypeptide is expressed as a fusion of an N-terminal 
hydrophobic signal sequence and a second polypeptide sequence encoding the 
recombinant polypeptide which is from a different source than the signal sequence. 

20 

The nucleic acid encoding the recombinant polypeptide can be provided to the host ceil 
using any technique known in the art. In one embodiment, the nucleic acid is inserted 
into the genome of the host cell using, for example, homologous recombination based 
techniques. In another embodiment, the nucleic acid Is transfected or transformed into 
25 the host cell In an expression vector which remains extrachromosomal. For example, 
the expression vector can be a plasmid or a virus. Further, It is preferred that the 
vector comprises a selectable marker which can be used to selectively propagate host 
cells comprising the vector. Such selectable markers and the use thereof are also 
known in the art. 

30 

In a second aspect, the present invention provides a method of producing a 
recombinant polypeptide, the method comprising culturing a host cell according to the 
second aspect under suitable conditions such that the recombinant polypeptide is 
produced, and recovering the recombinant polypeptide. 

35 

Since the proteolytic action arising from the metalloprotease has been reduced or 
inhibited, the method of the second aspect of the invention improves the stability of the 
recombinant polypeptide produced by the host cell. 
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In a preferred embodiment, the recombinant polypeptide is secreted from the host cell. 
Furthermore, it is preferred that the secreted protein is recovered during exponential 
growth of a culture comprising the host cell. 

5 

Preferably, the quantity of the recovered recombinant polypeptide is higher than if a 
parental host cell was used. More preferably, the quantity of the recovered recombinant 
polypeptide is at 50% higher than if a parental host cell was used. 

10 In a third aspect, the present invention provides a method of cleaving a polypeptide at 
a basic residue, the method comprising contacting the polypeptide, in the presence of a 
divalent cation, with a metalloprotease comprising a sequence selected from the group 
of: 

i) any one of SEQ ID NO's 4 to 15, and 
15 ii) a sequence which is at least 80% identical to any one of SEQ ID NO's 4 to 

15. 

In a particularly preferred embodiment, the metalloprotease comprises a sequence as 
provided in SEQ ID NO: 4, or a sequence at least 80% identical, such as at least 90%, 
20 such as at least 95% and such as 99% identical, thereto. 

Preferably, the metalloprotease cleaves the polypeptide at the C-terminal side of an 
amino acid, or sequence of amino acids, selected from the group consisting of; Lys, 
Arg, ArgArg, LysLys, ArgLys and LysArg. Accordingly, it is preferred that the 
25 polypeptide comprises Lys, Arg, ArgArg, LysLys, ArgLys or LysArg. Other sequence 
requirements may also be necessary for cleavage, however, these can readily be 
determined by routine experimentation. 

Preferably, the divalent cation is selected from the group consisting of: Zn 2+ , Co 2+ and 
30 Mn 2+ . 

The method of the third aspect can be performed in vivo, within a recombinant host cell 
producing the metalloprotease, or in vitro in suitable reaction conditions. Considering 
the present disclosure, the skilled addressee could readily perform the method of the 
35 third aspect. An example of an in vitro system for cleaving a polypeptide with the 
defined metalloprotease is provided herein. In this instance, the polypeptide is 
contacted with the metalloprotease provided a crude yeast cell extract in 0.1 M 
NaH 2 P0 4 (pH 4.5) and in the presence of 1 mM Mn 2 * and l mM bestatln. In another 



example, the metalloprotease can be recombinantly produced as a fusion protein with a 
suitable "tag", such as a His-tag, which enables easy purification of the fusion protein. 
Preferably, such a "tag" is removed (for example by enzymatic cleavage) before the 
metalloprotease is exposed to the substrate polypeptide. 

5 

In a fourth aspect, the present invention provides a method of identifying an agent that 
inhibits the activity of a metalloprotease comprising a sequence provided in SEQ ID 
NO: 1, the method comprising the steps of: 

(a) incubating the metalloprotease with the agent, in the presense of a divalent 
10 cation and a suitable substrate; 

(b) determining the activity the metalloprotease on the substrate; 

(c) comparing the activity obtained in step b) with the activity of a control 
sample that has not been incubated with the agent; and 

(d) selecting an agent that inhibits the activity of the metalloprotease. 

15 

The substrate can be any polypeptide that can be cleaved by the metalloprotease and 
the cleavage event detected. One example disclosed herein is the use of CCK as a 
substrate, where the cleavage event is detected by the production of CCK-22. Similar 
assays can readily be developed for other substrates. 

20 

In a preferred embodiment of the fourth aspect, the metalloprotease comprises a 
sequence as provided in SEQ ID NO: 4, or a sequence at least 80% identical, such as at 
least 90%, such as at least 95% and such as 99% identical, thereto. 

25 In a fifth aspect, the present invention provides a method of producing a recombinant 
polypeptide, the method comprising culturing a host cell comprising a nucleic acid 
sequence encoding a recombinant polypeptide under suitable conditions such that the 
recombinant polypeptide is produced, and recovering the recombinant polypeptide 
wherein said culturing comprises the presence of an inhibitor of a metalloprotease 

30 comprising a sequence provided in SEQ ID NO:l. 

Preferably, the Inhibitor Is identifred according to a method of the fifth aspect. 

As will be apparent, preferred features and characteristics of one aspect of the 
35 invention may be applicable to other aspects of the invention. 



Throughout this specification the word "comprise", or variations such as "comprises" or 
"comprising", will be understood to imply the inclusion of a stated element, integer or 
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step, or group of elements, Integers or steps, but not the exclusion of any other 
element, Integer or step, or group of elements, Integers or steps. 

The invention will hereinafter be described by way of the following non-limiting Figures 
5 and Examples. 

Brief Description of the Accompanying Drawings 

Figure 1. Cholecystokinin expression construct. PreproMfalp-proCCK fusion protein 
with the amino acid sequences around the fusion site and of the primary cleavage sites 
10 shown. The major forms of secreted CCK with their N- and C-terminal amino acid 
residues are shown below. 

Figure 2. CCK-22 maturation in cells and media as a function of cell growth. BJ2168 
expressing preproMfalp-proCCK fusion protein. The CCK-22 immunoreactivity was 
15 measured by RIA using Ab 89009 and total CCK content measured with Ab 89009 after 
tryptic cleavage. Open circles represent the fraction of secreted CCK-22, whereas the 
Intracellular fraction of CCK-22 Is presented as filled triangles. The cell growth was 
measured by OD 600 (open squares). The data represents mean of two independent 
experiments. 

20 

Figure 3. Chromatographic analyses of normal and K->A mutated CCK secreted from 
BJ2168. Media from yeast transformed with pR5426 preproMfalp-proCCK and pRS426 
preproMfalp-proCCK (K-»A) were subjected to G-50 gel chromatography and the CCK 
immunoreactivity was measured with Ab 7270 specific for Gly extended CCK (A and C) 
25 and Ab 89009, which is specific for the IM-terminus of CCK-22 (B and D). 

Figure 4. In vitro protease assay Including inhibitors and activators. The fraction of 
CCK-22 was calculated from the immunoreactivity using Ab 89009 divided by the total 
amount of mature and N-terminal extended CCK-22 measured with Ab 89009 after 
30 trypsin treatment. A, Effect of different inhibitors. B, Protease reactivation by addition 
of 1.2 mM divalent metal ions to extracts where the activity had been inhibited with 1 
mM EDTA. The data represents mean ± SD of three independent experiments. 

Figure 5- Protease reactivation by Zn 2+ and Mn 2+ . Jn vitro protease assays performed 
35 with cell extracts from UY123, where the activity was inhibited with 1 mM EDTA (filled 
squares) and reactivated by addition of 1.2 mM Mn 2+ (open circles) or 1.2 mM Zn 2+ 
(filled circles). The activity was measured as the fraction of matured CCK-22 after 30, 
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60 and 120 min incubation. The fraction of CCK-22 was calculated from the 
immunoreactivity using Ab 89009 divided by the total amount of mature and N- 
termina! extended CCK-22 measured with Ab 89009 after trypsin treatment. The data 
are represented by mean ± SD (n=3). 

5 

Figure 6. Extracellular CCK-22 maturation by members of the yapsin family. Intact 
cells ability to process extracellular CCK was analysed as described under 
"Experimental Procedures" for BY4705 and the isogenic ypsl, ypsl yps3 and ypsl yps2 
yps3 strains. The fraction of CCK-22 was calculated from the immunoreactivity using Ab 
10 89009 divided by the total amount of total CCK measured with Ab 89009 after trypsin 
treatment. The data are represented by mean ± SD (n=4). Statistics were performed 
using unpaired t test as described in experimental procedures (*** » P<0.001, ** = 

P<0.01 and * « P<0.05). 

15 Figure 7, Increased proteolysis following Cymlp overexpression. In vitro protease 
assays performed with cell extracts from BJ2168 transformed with an empty pRS425 
plasmid (A) and with pRS425 containing CYM1 (B). The CCK-22 Immunoreactivity was 
measured over time using Ab 89009 (filled squares and circles) and the total amount of 
mature and N-terminal extended CCK-22 measured with Ab 89009 after trypsin 

20 treatment (open squares and circles). The data represents mean ± SD of three 
independent experiments. 

Figure 8. Effects of KEX2 and CYM1 deletions on proCCK secretion and CCK-22 
maturation. Yeast cells transformed with the proCCK expression construct were 

25 harvested during exponential phase and the media collected. The intracellular (A) and 
extracellular (B) amount of total CCK was measured with Ab 89009 after trypsin 
treatment. The fraction of intracellular (C) and secreted (D) CCK-22 was calculated as 
the immunoreactivity measured with Ab 89009 before tryptic cleavage divided with the 
total amount of CCK measured in (A) and (B). The kex2, cyml and kex2 cyml strains 

30 are isogenic to BJ2168. The data are given as mean ± SD (n=4). Statistics were 

performed using unpaired t test (*** = P<0.001, ** = P<0.01 and * = P<0.05). The 
stars enclosed in brackets are comparison between the kex2 and kex2 cyml strain. 

Figure 9. Intracellular degradation of CCK depends on Cymlp cleavage to CCK-22. 
35 Expression of wild type CCK, preproMfcclp-proCCK, and the CCK mutant, preproMfalp- 
proCCK (K->A) in BJ2168 and a CYM1 disrupted strain isogenic to BJ2168. The cells 
were sedimented during exponential growth and the total amount of CCK (hatched 
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bars) was measured after trypsin and carboxypeptidase B treatment with Ab 7270 
specific for Gly-extended CCK. The amount of mature Gly-extended (white bars), which 
Is dependent on translocation into the secretory pathway, Kex2p and carboxypeptidase 
activity is measured as the immunoreactivity using Ab 7270 before tryptic cleavage and 
5 carboxypeptidase B treatment. The data are given as mean ± SD (n=3). 

Figure lO, Aspartyl proteases involved in the maturation of CCK-22. Expression of wild 
type proCCK in BY4705 and the isogenic yapsin deletion strains of YPSJ, YPS2 and 
YPS3, The intra- (A) and extracellular (B) fraction of synthesised CCK-22 was measured 
10 during exponential growth. The fraction of mature CCK-22 was calculated as the 
immunoreactivity measured with Ab 89009 before tryptic cleavage divided with the 
total amount of CCK. The data are represented by mean ± SD (n=3). Statistics were 
performed using unpaired t test as described in experimental procedures (*** = 
P<0.001, ** = P<0.01 and * = P<0.05). 

15 

Figure 11- Cymlp processing C-terminally to both Lys and Arg residues. CYM1 
deletion enhance the amount of secreted CCK with more than two fold of both wild type 
CCK and the Lys 61 -> Arg 61 mutant. Expression of wild type CCK, preproMfalp-proCCK, 
and the CCK mutant, preproMfalp-proCCK (Lys 61 Arg 61 ) in BJ2168 and a CYM1 
20 disruptant isogenic to BJ2168. Yeast cells were harvested during exponential phase and 
the media collected. The intracellular (A) and extracellular (B) amount of total CCK was 
measured with Ab 89009 after trypsin treatment. The data are given as mean ± SO 
(n=3). 

25 Figure 12. Secreted proCCK fragments identified by mass spectrometry. The CCK- 
numbers refer to C-terminal amidated CCK. The molecular masses are given as 
monoisotopic values except for * which denote average value. Strain A, vacuolar 
protease-deficient strain (BJ2168), and B, the isogenic strain with KEX1 KEX2 
disruptions (UY22). 

30 

Figure 13. Model for the production of the C-termlnally extended CCK (A) and GLP2 
(B). Expression of these fusion peptides should be performed in a sec61 mutant, or the 
pre-sequence of the a-matlng factor should be removed to avoid translocation Into the 
ER. The amino acid sequences around the fusion sites are shown. Underlined are the N- 
35 and C-terminal amino acids of the Gly-extended CCK-22 and GLP1. 

Key to the Sequence Listing 

SEQ ID NO:l - Consensus sequence for pitrilysin proteases. 
SEQ ID NO:2 - Consensus sequence for at least some pitrilysin proteases. 
40 SEQ ID NO: 3 - Consensus sequence for at least some pitrilysin proteases. 
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SEQ ID NO:4 - Saccharomyces cerevfsiae Cymlp (Swisspnot Accession No. P32898). 
SEQ ID NO:5 - Schizosaccharomyces pombe CI 19.7 (Swissprot Accession No. 
042908). 

SEQ ID NO: 6 - Clostridium perfringens HypA protein (Swissprot Accession No. 
5 Q46205). 

SEQ ID NO:7 - BorreJia burgdorferi protein BB0228 (Swissprot Accession No. 051246). 
SEQ ID NO:8 - Caenorhabditis efegans C05D11.1 protein (Swissprot Accession No. 
P48053). 

SEQ ID NO:9 - E> coti protease III (Swissprot Accession No. P05458). 
10 SEQ ID NO: 10 - Human NRD convertase (Swissprot Accession No. P47245). 

SEQ ID NO: 11 - Human insuiysin (Swissprot Accession No. P14735). 

SEQ ID NO:12 - Arabidopsis thaliana CPE (Genbank Accession No. T03302). 

SEQ ID NO: 13 - Human metalloprotease I (GenBank Accession No. AAH01150). 

SEQ ID NO: 14 - Bacillus subtilis zinc protease ymxG (GenBank Accession No. Q04805). 
15 SEQ ID NO: 15 - Mycobacterium tuberculosis zinc protease Rv2782c (GenBank 

Accession No. 033324). 

SEQ ID NO's 16 to 42 - Oligonucleotides. 

SEQ ID NO's 43 to 52 - Sequences provided in Figure 12. 

SEQ ID NO's 53 to 55 - Sequences provided in Figure 1. 

20 Detailed Description of the Invention 

The present invention provides a host cell useful for the expression of a polypeptide, 
said cell being genetically manipulated in order to at least produce reduced levels of a 
defined metalloprotease, when compared to the parental cell. The host cell wili thus be 
able to express a protein of interest in higher quantity due to the proteolytic action of 
25 the metalloprotease has been reduced or inhibited which improves the stability of the 
protein of interest. 

By the method of the invention, the proteolytic action of the metalloprotease has been 
reduced or inhibited, thereby improving the stability of the product obtained. 

30 

One embodiment of the present invention relates to a host cell useful for the 
expression of a protein of interest, wherein said cell has been genetically modified In 
order to express significantly reduced levels of a metalloprotease which is at least 80% 
Identical to the SEQ ID NO: 4, as compared to a parental cell. 

35 

In the present context, the term "protein of interest" relates to any of the numerous 
naturally native occurring extremely complex substances such as but not limited to 
proteins, enzymes and/or antibodies that consist of amino acid residues joined by 
peptide bonds. It is an object of preferred embodiments of the present invention to 
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provide such native proteins which are products of the host cell itself and/or 
heterologous proteins, fusion proteins, recombinant proteins, eucaryotic proteins, 
prokaryotic protelns / lysosomal proteins, vacuolar proteins, precursor proteins, 
zymogene proteins, prepro-proteins, and secreted proteins. 

5 

In the present context, the term "host cell" relates to any cell capable of producing the 
protein of interest. Thus in one prefered embodiment, the host is a prokaryotic ceil. In 
another preferred embodiment, the host cell is a eukaryotic cell, such as but not limited 
to a filamentous fungal cell and a non-filamentous fungal cell. Non limiting examples 
10 hereof are a strain of Saccharomyces, especially Saccharomyces cerevisiae. 

All the features desricbed below relating to the method of the present invention are 
also applicable as embodiments relating to the host cell. 

15 The method described in the present application relates to the production of a protein 
of interest in a host cell, wherein said host cell has been genetically modified in order to 
express significantly reduced levels of a metalloprotease which is at least 80% identical 
to the 5EQ ID NO: 4 as compared to a parental cell, when cultured under identical 
conditions, comprising 

20 

a) introducing into the host ceil a nucleic acid sequence encoding the protein of 
interest; 

b) cultivating the host cell of step (a) in a suitable growth medium for 
25 production of the protein of interest and 

c) isolating the protein of interst. 

One embodiment of the present invention relates to a method for the production of a 
30 protein of interest in a host cell, wherein the host cell has been genetically modified by 
a method selected form the group comprising gene knock-out, gene disruption, random 
or site directed mutagenesis, introduction of dominant-negative metalloproteases, RNA 
interference (RNAi) using dsRNA, catalytic nucleic acids (such as ribozymes and 
DNAzymes), antisense nucleic acids or a combination thereof. 

35 

In a most preferred embodiment the host cell is essentially free of any metalloprotease 
activity. 

One preferred embodiment of the present invention relates to a method for the 
40 production of a protein of interest in a host cell, in which the protein of Interest is a 
eucaryotic protein, selected from the group comprising insulin, growth hormone, 
glucagon, somatostatin, inteferon, adrenocorticotropic hormones, anglotensinogen, 
atrial natriuretic peptides, dynorphin, endorphines, galanin, gastrin, gastrin releasing 
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peptides, neuropeptide Y fragments, pancreastatin, pancreatic polypeptides, secretin, 
vasoactive Intestinal peptide, growth hormone releasing factor, melanocyte stimulating 
hormone, neurotensin, adrenal peptide, parathyroid hormone and related peptides, 
somatostatin and related peptides, brain natriuretic peptide, calcitonin, corticotropin 
5 releasing factor (CRF), cocaine amphetamine regulated transcript (CART), thymosin, 
urotensin, glucagon and glucagon like peptides (GLP-1 and GLP-2), somatostatin, 
interferon, a vascular endothelial growth factor (VEGF), platelet derived growth factor 
(PDGF), factor VII, factor VIII, factor V, factor IX, inteiieukins, urokinase, 
erythropoietin (EPO), chymosln, tissue plasminogen activator, CCK or serum albumin. 

10 

Another preferred embodiment of the present Invention relates to a method for the 
production of a protein of interest in a host cell, in which the protein of interest is a 
protein of fungal origin, selected from the group comprising an amyloiytic enzyme, an 
alpha-amylase, a beta-amylase, a glycoamylase, a alpha-galactosidase, a cellulytic 
15 enzyme, a lipolytic enzyme, a xylanolytic enzyme, a proteolytic enzyme, an 
oxidoreductase, a peroxidase, a laccase, a pecttnase, or a cutinase. 

A further preferred embodiment of the present Invention relates to a method for the 
production of a protein of interest in a host cell, in which the protein of interst is a 
20 bacterial protein, selected from the group comprising an amyloiytic enzyme, an alpha- 
amyfase, a beta-amylase, a glycoamylase, a beta-galactosidase, a cellulytic enzyme, a 
lipolytic enzyme, a xylanolytic enzyme, a proteolytic enzyme, an oxidoreductase, a 
peroxidase, a laccase, a pectinase, or a cuttnase. 

25 A special embodiment of the present invention relates to a method for production of a 
protein of interest in a host cell, in which the protein of interest is a precursor, i.e. a 
zymogen, a hybrid protein, a protein obtained as a pro sequence or pre-pro sequence, 
or in unmaturated form. 

30 General Molecular Biology 

Unless otherwise indicated, the recombinant DNA techniques utilised In the present 
invention are standard procedures, well known to those skilled in the art. Such 
techniques are described and explained throughout the literature in sources such as, J. 

35 Perbal, A Practical Guide to Molecular Cloning, John Wiley and Sons (1984), 3. 
Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbour 
Laboratory Press (1989), T.A. Brown (editor), Essential Molecular Biology: A Practical 
Approach, Volumes 1 and 2, 1RL Press (1991), D.M. Glover and B.D. Names (editors), 
DNA Cloning: A Practical Approach, Volumes 1-4, IRL Press (1995 and 1996), F.M. 

40 Ausubel et al. (Editors), Current Protocols in Molecular Biology, Greene Pub. Associates 
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and Wiiey-Interscience (1988, Including all updates until present), Methods In 
Enzymology. Vol 194. Guide to Yeast Genetics and Molecular Biology. (1991) Ed 
Gunthrie and Fink Academic Press, Methods In Microbiology Vol. 26. Yeast Gene 
Analysis. (1998) Ed. Brown and Tuite. Academic Press, Miller, J. H. (1992) A Short 
5 Course in Bacterial Genetics (Manual, L., ed), Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY, Johnston, J. R (1994) Molecular Genetics of Yeast (A Practical 
Approach) Oxford University Press, Oxford., and Molecular Genetics of Yeast: A 
Practical Approach, Ed. J.R. Johnston, IRL Press (1994) and are incorporated herein by 
reference. 

10 

The % identity of a polypeptide is determined by GAP (Needleman and Wunsch, 1970) 
analysis (GCG program) with a gap creation penalty=5, and a gap extension 
penalty-0.3. The query sequence is at least 15 amino acids in length, and the GAP 
analysis aligns the two sequences over a region of at least 15 amino acids. More 

15 preferably, the query sequence is at least 50 amino acids in length, and the GAP 

analysis aligns the two sequences over a region of at least 50 amino acids. Even more 
preferably, the query sequence is at least 100 amino acids in length and the GAP 
analysis aligns the two sequences over a region of at least 100 amino acids. More 
preferably, the query sequence is at least 250 amino acids in length and the GAP 

20 analysis aligns the two sequences over a region of at least 250 amino acids. Even more 
preferably, the query sequence is at least 500 amino acids in length and the GAP 
analysis aligns the two sequences over a region of at least 500 amino acids. 

Pitriiysin Subfamily of Metalloproteases 

25 

The pitriiysin subfamily of metalloproteases is characterized by the presence of a 
HXXEH (SEQ ID NO:l) motif. A general review of this subfamily is provided by Rawlings 
and Barrett (1995). Members of this family include, but are not limited to, S. cerevisiae 
Cymlp (SEQ ID NO:4) (Swissprot Accession No. P32898), Schizosaccharomyces pombe 

30 CI 19.7 (SEQ ID NO: 5) (Swissprot Accession No. 042908), Clostridium perfringens 
HypA protein (SEQ ID NO:6) (Swissprot Accession No. Q46205), Borrelia burgdorferi 
protein BB0228 (SEQ ID NO:7) (Swissprot Accession No. 051246), Caenorhabditis 
elegans C05D11.1 protein (SEQ ID NO:8) (Swissprot Accession No. P48053), E. coll 
protease III (SEQ ID NO:9) (Swissprot Accession No. P05458), human NRD convertase 

35 (SEQ ID NO: 10) (Swissprot Accession No. P47245), human insulysln (SEQ ID NO: 11) 
(Swissprot Accession No. P14735), Arabfdopsis thaliana CPE (SEQ ID NO: 12) (Genbank 
Accession No. T03302), human metalloprotease I (SEQ ID NO: 13) (GenBank Accession 
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No. AAH01150), Bacillus subtilis zinc protease ymxG (SEQ ID NO: 14) (GenBank 
Accession No. Q04805), and Mycobacterium tuberculosis zinc protease Rv2782c (SEQ 
ID NO: 15) (GenBank Accession No. 033324). For E. coli protease III (SEQ ID NO:9) it 
has been shown that the His residues of SEQ ID NO:l, as well as Giu-169, are Involved 
5 in divalent cation binding whilst the Glu residue flanked by the His residues is a 
catalytic residue. 

A gene encoding a pitrilysin m eta I lo protease can readily be identified by screening by 
hybridization for nucleic acid sequences coding for all of, or part of, the 
10 metalloprotease, e.g. by using synthetic oligonucleotide probes, that may be prepared 
on the basis of a cDNA sequence, e.g. the nucleotide sequences encoding any one of 
the metalloproteases presented as SEQ ID NO's: 4 to 15, In accordance with standard 
techniques. 

15 Genetic Manipulations 

The host cell of the invention which Is genetically manipulated in order to produce 
reduced levels of the defined metalloprotease may be modified using standard 
recombinant DNA technology known to the person skilled in the art. The gene sequence 
20 responsible for the production of the metalloprotease may be inactivated or eliminated 
entirely. 

In a particular embodiment, the host cell of the invention is one genetically 
manipulated at the coding or regulatory regions of the metalloprotease gene. Known 
25 and useful techniques include, but are not limited to, gene knockout, gene disruption, 
random or site directed mutagenesis, introduction of dominant-negative 
metalloproteases, RNA interference (RNAi) using dsRNA, catalytic nucleic acids (such as 
ribozymes and DNAzymes), and antisense nucleic acids, or a combination thereof. 

30 Mutagenesis may be performed using a suitable physical or chemical mutagenizing 
agent. Examples of a physical or chemical mutagenizing agent suitable for the present 
purpose includes ultraviolet (UV) irradiation, hydroxylamine, N-methyl-N'-nitro-N- 
nitrosoguanidlne (MNNG), O-methyl hydroxylamine, nitrous acid, ethyl methane 
sulphonate (EMS), sodium bisulfite, formic acid, and nucleotide analogues. When such 

35 agents are used, the mutagenesis Is typically performed by incubating the cell to be 
mutagenized in the presence of the mutagenizing agent of choice under suitable 
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conditions for the mutagenesis to take place, and selecting for mutated celis having a 
significantly reduced production of metalloprotease. 

Genetic manipulation may also be accomplished by the introduction, substitution or 
5 removal of one or more nucleotides in the metalloprotease coding sequence or a 
regulatory element required for the transcription or translation thereof. Nucleotides 
may, for example, be Inserted or removed so as to result in the introduction of a stop 
codon, the removal of the start codon or a change of the open reading frame. The 
modification or inactivation of the structural sequence or a regulatory element may be 
10 accomplished by site-directed mutagenesis or PCR generated mutagenesis in 
accordance with methods known in the art. 

A convenient way to inactivate or reduce the metalloprotease production of a host cell 
is based on the principles of gene interruption, This method involves the use of a DNA 

15 sequence corresponding to the endogenous gene or gene fragment which it is desired 
to destroy. The DNA sequence is fn vitro mutated to a defective gene and transformed 
into the host cell. By homologous recombination, the defective gene replaces the 
endogenous gene or gene fragment. It may be desirable that the defective gene or 
gene fragment encodes a marker which may be used for selection of transformants in 

20 which gene encoding the metalloprotease has been modified or destroyed. 

The term "antisense" as used herein refers to nucleotide sequences which are 
complementary to a specific nucleic acid sequence. Antlsense molecules may be 
produced by any method, including synthesis by figating the gene(s) of interest in a 
25 reverse orientation to a viral promoter which permits the synthesis of a complementary 
strand. Once introduced into a host cell, this transcribed strand combines with natural 
sequences, in this instance that encoding the metalloprotease, produced by the cell to 
form duplexes. These duplexes then block either the further transcription or translation. 
In this manner, mutant phenotypes may be generated. 

30 

The term catalytic nucleic acid refers to a DNA molecule or DNA-containing molecule 
(also known in the art as a "deoxyribozyme") or an RNA or RNA-containing molecule 
(also known as a "ribozyme") which specifically recognizes a distinct substrate and 
catalyzes the chemical modification of this substrate. The nucleic acid bases in the 
35 catalytic nucleic acid can be bases A, C, G, T and U, as well as derivatives thereof. 
Derivatives of these bases are well known in the art. 
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Typically, the catalytic nucleic acid contains an antisense sequence for specific 
recognition of a target nucleic acid, and a nucleic acid cleaving enzymatic activity, also 
referred to herein as the "catalytic domain". The types of ribozymes that are 
particularly useful in this invention are the hammerhead ribozyme (Haseloff and 
5 Gerlach, 1988) and the hairpin ribozyme (Shippy et al., 1999). 

Ribozymes useful for the methods of the invention, and DNA encoding the ribozymes, 
can be chemically synthesized using methods well known in the art. The ribozymes can 
also be prepared from a DNA molecule (that upon transcription, yields an RNA 

10 molecule) operably linked to an RNA polymerase promoter, e.g., the promoter for T7 
RNA polymerase or SP6 RNA polymerase. When the vector also contains an RNA 
polymerase promoter operably linked to the DNA molecule, the ribozyme can be 
produced in vitro upon incubation with RNA polymerase and nucleotides. In a separate 
embodiment, the DNA can be Inserted into an expression cassette or transcription 

15 cassette. After synthesis, the RNA molecule can be modified by ligation to a DNA 
molecule having the ability to stabilize the ribozyme and make it resistant to RNase. 
Alternatively, the ribozyme can be modified to the phosphothio analog for use in 
liposome delivery systems. This modification also renders the ribozyme resistant to 
endonuclease activity. 

20 

dsRNA (RNAi) is particularly useful for specifically inhibiting the production of a 
particular protein. This technology relies on the presence of dsRNA molecules that 
contain a sequence that is essentially identical to the mRNA of the gene of interest, in 
this case a mRNA encoding the metalloprotease. Conveniently, the dsRNA is produced 

25 In a single open reading frame in a recombinant vector or host ceil, where the sense 
and anti-sense sequences are flanked by an unrelated sequence which enables the 
sense and anti-sense sequences to hybridize to form the dsRNA molecule with the 
unrelated sequence forming a loop structure. The design and production of suitable 
dsRNA molecules for genetic manipulation is well within the capacity of a person skilled 

30 in the art, particularly considering Waterhouse et al. (1998), Elbashir et al. (2001), WO 
99/32619, WO 99/53050, WO 99/49029, and WO 01/34815. 

Owing to the genetic manipulation, the host cell of the invention expresses significantly 
reduced levels of the metalloprotease. In a preferred embodiment, the level of 
35 metalloprotease expressed by the host cell is reduced more than about 25%, such as 
more than about 30%, such as more than about 35%, such as more than about 40%, 
such as more than about 45%, such as more than about 50%, such as more than 
about 55%,such as more than about 60%, such as more than 65about %, such as 
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more than about 70%, such as more than about 75%, such as more than about 80%, 
such as more than about 85%, such as more than about 90%, such as more than 
about 95%, such as more than about 98%, and such as more than about 99%. 

5 In a most preferred embodiment, the product expressed by the host cell is essentially 
free of any activity of the defined metalloprotease. 

In the present context, the term "essentially free" relates to a host, wherein the 
metalloprotease expressed by said host cell is reduced to a level, where the function of 
10 said metalloprotease has no biological significant reducing influence on the production 
of the protein of interest. 

Protein of Interest 

15 The terms "polypeptide", "protein" and "peptide" are used herein interchangeably and 
in the present context relates to any of the numerous naturally occurring extremely 
complex substances such as but not limited to enzymes or antibodies that consist of 
amino acid residues joined by peptide bonds, contain the elements carbon, hydrogen, 
nitrogen, oxygen, usually sulphur, and occasionally other elements such as but not 

20 limited to phosphorus or iron, that are essential constituents of all living ceils, that are 
in nature synthesised from raw materials by plants but assimilated as separate amino 
acids by animals, that are both acidic and basic and usually colloidal in nature although 
many have been crystallised, and that are hydrolyzable by acids, alkalies, proteolytic 
enzymes, and putrefactive bacteria to polypeptides, to simpler peptides, and ultimately 

25 to alpha-amino acids. 

As defined herein, a "recombinant polypeptide" is a protein which is not native to the 
host cell, or a native polypeptide in which modifications have been made to aiter the 
native sequence, or a native protein whose expression is quantitatively altered as a 
30 result of a manipulation of a native regulatory sequence required for the expression of 
the native protein, such as a promoter, a ribosome binding site, etc., or other 
manipulation of the host cell by recombinant DNA techniques. 

Owing to the absence or reduction in activity of the defined metalloprotease, at least a 
35 portion of the recombinant polypeptides expressed by the host cell may also be a 
precursor protein, i.e. a zymogen, a hybrid protein, a protein obtained as a pro 
sequence or pre-pro sequence, or In unmaturated form. 
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In a more specific embodiment, the recombinant polypeptide is of eucaryotic origin, 
such as Insulin, adrenocorticotropic hormones, angiotensinogen, atrial natriuretic 
peptides, dynorphin, endorphines, galanin, gastrin, gastrin releasing peptides, 
5 neuropeptide Y fragments, pancreastatin, pancreatic polypeptides, secretin, vasoactive 
intestinal peptide, growth hormone releasing factor, melanocyte stimulating hormone, 
neurotensin, adrenal peptide, parathyroid hormone and related peptides, somatostatin 
and related peptides, brain natriuretic peptide, calcitonin, corticotropin releasing factor 
(CRF), cocaine amphetamine regulated transcript (CART), thymosin, urotensin, 
10 glucagon and glucagon like peptides (GLP-1 and GLP-2), somatostatin, interferon, a 
vascular endothelial growth factor (VEGF), platelet derived growth factor (PDGF), factor 
VII, factor VIII, factor V, factor IX, Interleukins, urokinase, erythropoietin (EPO), 
chymosin, tissue plasminogen activator, CCK or serum albumin. 

15 With specific regard to "glucagon and glucagon like peptides", this term as used herein 
may refer to polypeptides of human origin or from other animals and recombinant or 
semisynthetic sources and Include all members of the glucagon family, such as GRPP 
(glicentine related polypeptide), glucagon, GLP-1 (glucagon like peptide 1), and GLP-2 
(glucagon like peptide 2), including truncated and/or N-terminally extended forms, 

20 such as GLP-l(7-36), and includes analogues, such as GLP-1(7-35)R36A GLP-2 F22Y, 
GLP-2 A19T+34Y. GLP2 A2G and GLP-2 A19T, and other analogues having from 1 to 3 
amino acid changes, additions and/or deletions. 

Host Cells and the Expression of Recombinant Polypeptides Therefrom 

25 

The host cells for use in the present invention can be procaryotic or eucaryotic. The 
eucaryotic host cells for use in the present invention can be, for example, fungal, 
mammalian, plant or insect cells. Preferably, the host cells are yeast cells. 

30 In order to produce the desired polypeptide, the host cell of the invention comprises a 
nucleic acid sequence encoding the recombinant polypeptide as well as regulatory 
sequences for directing the expression of the desired product such as regions 
comprising nucleotide sequences necessary or e.g. transcription, translation and 
termination. The genetic design of the host cell of the invention may be accomplished 

35 by the person skilled in the art, using standard recombinant DNA technology for the 
transformation or transfectlon of a host ceil. 
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Preferably, the host cell is modified by methods known In the art for the introduction of 
an appropriate expression cassette in, for example a plasmid or a viral vector, 
comprising the nucleic acid encoding the recombinant polypeptide. The expression 
cassette may be introduced into the host cell by a number of techniques including, but 
5 not limited to, as an autonomously replicating plasmid or integrated into the 
chromosome. 

Expression cassettes may contain regulatory sequences such as transcription control 
sequences, translation control sequences, origins of replication, and other regulatory 

10 sequences that are compatible with the recombinant cell and that control the 
expression of nucleic acid molecules encoding the recombinant polypeptide. In 
particular, recombinant nucleic acid molecules of the present invention include 
transcription control sequences. Transcription control sequences are sequences which 
control the initiation, elongation, and termination of transcription. Particularly important 

15 transcription control sequences are those which control transcription initiation, such as 
promoter, enhancer, operator and repressor sequences. Suitable transcription control 
sequences include any transcription control sequence that can function in at least one 
of the recombinant cells of the present invention. A variety of such transcription control 
sequences are known to those skilled in the art. Preferred transcription control 

20 sequences Include those which function in bacterial, yeast, arthropod and mammalian 
cells, such as, but not limited to, tac, lac, trp, trc, oxy-pro, omp/lpp, rrnB, 
bacteriophage lambda, bacteriophage T7, T7lac, bacteriophage T3, bacteriophage SP6, 
bacteriophage SP01, metailothionein, alpha-mating factor, Pichia alcohol oxidase, 
alphavirus subgenomic promoters (such as Sindbis virus subgenomic promoters), 

25 antibiotic resistance gene, baculovirus, Heliothis zea insect virus, vaccinia virus, 

herpesvirus, raccoon poxvirus, other poxvirus, adenovirus, cytomegalovirus (such as 
intermediate early promoters), simian virus 40, retrovirus, actin, retroviral long 
terminal repeat, Rous sarcoma virus, heat shock, phosphate and nitrate transcription 
control sequences as well as other sequences capable of controlling gene expression in 

30 prokaryotic or eukaryotic ceils. Transcription control sequences of the present invention 
are most preferably naturally occurring transcription control sequences associated with 
yeast. Suitable promoters for S. cerevisiae include the MFal promoter, galactose 
inducible promoters such as GAL1, GAL7 and GAL10 promoters, glycolytic enzyme 
promoters including TPI1 and PGK1 promoters, TRP1 promoter, CYCI promoter, CUP1 

35 promoter, PHOS promoter, ADH1 promoter, and HSP promoter. A suitable promoter in 
the genus Pichia is the AOXI (methanol utilisation) promoter. 
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Recombinant polypeptides of the present invention may also (a) contain secretory 
signals to enable an expressed polypeptide to be secreted from the cell that produces 
the polypeptide and/or (b) contain fusion sequences which lead to the expression of 
fusion proteins. Examples of suitable signal segments include any signal segment 
5 capable of directing the secretion of the fusion protein. Preferred signal segments 
include, but are not limited to, tissue plasminogen activator (t-PA), interferon, 
interleukin, growth hormone, histocompatibility and viral envelope glycoprotein signal 
segments, the leader sequence originating from the fungal amyioglucosidase (AG) gene 
such as galA - both 18 and 24 amino acid versions e.g. from Aspergillus sp., the a- 
10 factor gene of yeasts e.g. from Saccharomyces sp. and Kluyveromyces sp., the P-f actor 
of Schizosaccharomyces sp., and the a-amylase gene from Bacillus sp., as well as 
natural signal sequences. 

The cloning vehicle may also comprise a selectable marker, e.g. a gene, the product of 
15 which complements a defect in the host cell, or one which confers antibiotic resistance, 
such as ampiciltin, kanamycin, chloramphenicol or tetracycline resistance. 

The procedures used to ligate the DNA construct of the invention, the promoter, 
terminator and other elements, respectively, and to insert them into suitable cloning 
20 vehicles containing the Information necessary for replication, are well known to persons 
skilled in the art. 

Recombinant DNA technologies can be used to Improve the expression of transformed 
nucleic acid molecules by manipulating, for example, the number of copies of the 

25 nucleic acid molecules within a host cell, the efficiency with which those nucleic acid 
molecules are transcribed, the efficiency with which the resultant transcripts are 
translated, and the efficiency of post-translational modifications. Recombinant 
techniques useful for increasing the expression of nucleic acid molecules useful for the 
methods of the present invention include, but are not limited to, operably linking the 

30 nucleic acid molecule to high-copy number plasmlds, integration of the nucleic acid 
molecule into one or more host cell chromosomes, addition of vector stability 
sequences to plasmids, substitutions or modifications of transcription control signals 
(e.g., promoters, operators, enhancers), substitutions or modifications of translational 
control signals (e>g., ribosome binding sites, Shine-Dalgarno sequences), modification 

35 of nucleic acid molecule to correspond to the codon usage of the host cell, and the 
deletion of sequences that destabilize transcripts. The activity of an expressed 
recombinant polypeptide of the present invention may be improved by fragmenting, 
modifying, or derivatizing polynucleotide molecules encoding such a protein. 
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Methods of Producing Recombinant Polypeptides 

Host cells that have been transfected or transformed with the nucleic acid encoding the 
5 recombinant polypeptide are cultured for a sufficient time and under appropriate 

conditions known to those skilled in the art In view of the teachings disclosed herein to 
permit the production, and preferably secretion, of the polypeptide, followed by 
recovery of the desired product. 

10 Furthermore, owing to the reduced activity of the metalloprotease, the recombinant 
polypeptide expressed by the host cell may be obtained as a precursor protein, i.e. a 
zymogen, a hybrid protein, a protein obtained as a pro sequence or pre-pro sequence, 
or in unmaturated form. 

15 The broth or medium used for culturing may be any conventional medium suitable for 
growing the host cell In question, and may be composed according to the principles of 
the prior art. The medium preferably contains carbon and nitrogen sources and other 
inorganic salts. Suitable media, e.g. minimal or complex media, are available from 
commercial suppliers, or may be prepared according to published protocols. 

20 

With regard to yeast host cells, it is often advantageous to produce heterologous 
polypeptides in a diploid yeast culture, because possible genetica) defects may become 
phenotypicalty expressed in a haploid yeast culture, e.g. during continuous 
fermentation in production scale, and because the yield may be higher. The production 
25 of recombinant polypeptides in yeast host cell is described in Molecular Genetics of 
Yeast: A Practical Approach, Ed. J.R. Johnston, IRL Press (1994) which is incorporated 
herein by reference. 

After cultivation, the protein is recovered by conventional methods for isolation and 
30 purification proteins from a culture broth. Well-known purification procedures include 
separating the cells from the medium by centrifugation or filtration, precipitating 
proteinaceous components of the medium by means of a salt such as ammonium 
sulphate, and chromatographic methods such as e.g. ion exchange chromatography, 
gel filtration chromatography, affinity chromatography, etc. 



35 
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Examples 

Materials and Methods 

Yeast strains and growth conditions 

5 The yeast strains used are listed in Table I. Construction of strains were carried out 
using either the two step gene disruption technique (Rothstein, 1991) or the PCR based 
method by (Brachmann et al., 1998). Media were purchased from Difco, amino acids 
and other supplements from Sigma -Aldrich. Yeast cells were grown at 30°C in YPD (1% 
yeast extract, 2% peptone and 2% dextrose) or synthetic complete media (SC) based 

10 on yeast nitrogen base with ammonium sulfate, succinic acid, NaOH and appropriate 
amino acids. Transformations with either linear DNA or plasmids were performed using 
the modified lithium acetate procedure as described (Gietz et al., 1995). Analysis of 
heterologous expressed CCK was performed from yeast growing in exponential phase 
due to the consistency in CCK-22 biosynthesis, in contrast to the results from yeast 

15 within the stationary phase (Fig. 2). ProCCK processing was analysed from cell extract 
and media of 5 Asoo units of cells per ml synthetic complete media. Cell growth was 
followed by the absorbance at 600 nM. 

Table 1. 5. cerevisiae strains used in this study. Null mutants of putative 
20 metalioproteases are named by the ORF in the genotype and (*) represents 
mitochondrial proteases. 
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Strain 


Genotype 


Source 


BY4705 


MATa ade2* hisG lns3A2Q0 leu2A0 (ys2A0 ura3A0 


(Brachmann ct a!.. 1998) 


UYI3 


MAT a adelA JusG lns3A200 leu2A0 tys2AQ ura3A0ypsl:. TRPI 


This siudy 


UYI4 


MATaade2A::hisG lus3A200 !eu2A0 tys2A0 nra3A0ypsl:.TRPl yps3: LEV2 


This study 


UYI5 


MATaade2A:lusG hm3A200 hu2A0 lys2A0 uw3A0ypsl::TRPI yps3. LEU2yps2'URA3 


This study 


BJ2168 


MATa prcl-407 prbl-l 122 pep4-3 teu2irpl ura3-52gal2 


(Jones. 1991) 


UY2I 


MATa prcl-407 prbl-l 122 pep4-3leu2 trpl ura3-52 gal2 kexl '. LEU2 


This siudy 


UY22 


MATa prcl-407 prbl- 1 122 pep4-3 teu2 trpl ura3-52 ga/2 kexl r LEW kex2::TRPl 


This siudy 


UY23 


MATa prcl -407 prbl-l 122 pep4-3 leu2 trpl ura3-52 gate kex2: TRPI 


This study 


UY122 


MATa ape/ -;KANMX ape2:L YS2 hs3A0 leu2A0 lys2A0 ura3A0 


This study 


UYI23 


MATa ape 1:: KANMX ope2::L YS2 ape3 ■ LEV2 lns3A0 leu2A0 lys2AQ ura3A0 


This study 


UY20I 


MA Ta prcl-407 prbl -II 22 pep4-3 teu2 trpl ura3-52 gal2 axil :LEU2 


This siudy 


LJY202 


MATa prcl-407 prbl-l 122 pep4-3 lev2 trpl ura3-52 gal2 ste24:. LEU2 


This siudy 


UY203 


MATa prcl-407 prbl-l 122 pep4-3 ieu2 trpl nra3-52gal2prdl:LEU2 


This study 


UY204 


MATa prcl-407 prbl-l 122 pep4-3leit2 trpl ura3-52 gal2yil?08\v:.LEU2 


This study 


Y 15298 


MATahis3AI lcn2A0 lys2A0 vra3AQ ste23" KANMX 


. Euroscarf 


Y 11874 


MATahts3Al leu2A0 lys2A0 ura3A0 aapl: KANMX 


Euroscarf 


YI0148(*) 


MA Ta his3AI leu2AD lys2AQ ura3A0 cfg3. KANMX 


Euroscaif 


YI4953 


MATa hts3AI le»2A0 fys2A0 ura3A0 apeL KANMX 


Euroscarf 


Y 16224 I*) 


MATa hts3At leu2A0 lys2A0 ura3A0 real. KANMX 


Euroscarr 


Y14984{«) 


MATahis3AI lcu2AQ tys2Afi ura3A0 mtpl::KANMX 


Euroscarf 


Y17144 


MATa his3Al leu2A0 lys2M ura3A0 yrnel::KANMX 


Euroscarf 


Y132II 


MATahis3Al1eu2A0lys2AQura3A0ybrO74w:.KANMX 


Euroscarf 


YI380I 


MATahts3Al lett2A0lys2A0ura3A0ydll04c. KANMX 


Euroscarf 


Y1194I 


MATahis3Atleu2Aniys2A0ura3A0yhrll3w KANMX 


Euroscarf 


Y 11960 


MATa hts3AI lett2AQ lys2A0ura3AQyhrl32c. KANMX 


Euroscarf 


Y12296 


MATaMi3Alku2A0lys2A0ura3A0ylll37c-KANMX 


Euroscarf 


Y I 5370 


MATa ltis3AI teu2A0 lys2A0 ura3A0 ynl045xv.\ KANMX 


Euroscarf 


I0864B 


MATa ttra3- ASS 1 Ieu2-Al his3A200lys2A202ykr035c-ykr03Sc:.URA3 


Euroscarf 


YII749 


MATahts3Al te»2A0 lys2A0ura3A0yoW57w::KANMX 


Euroscarf 


Y16248 


MATahis3At tort AO fys2A0 ura3A0yol098c::KANMX 


Euroscarf 


I023IB 


MATahts3A200leu2AI trplA63 ura3-S2yoll54w(4.744):.KANMX 


Euroscarf 


Y 14266 


MATahis3Al teu2A0 lys2A0 ura3Mydr430c::KANMX 


Euroscarf 


UY430 


MATaprcl-407 prbl-l 122 pep4-3leu2 trpl nra3-52gal2ydr430c..LEV2 


This study 


UY432 


MATa prcl-407 prbl-l 122 pep4-3 Ieu2 trpl ura3-52gol2 kex2:TRPi ydr430c~LEU2 


This siud> 
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DNA extraction and amplification 

Yeast genomic DNA was isolated as described (Philippsen et al., 1991). Polymerase 
chain reaction (PCR) was performed using either Pwo polymerase or the enzyme 
5 cocktail based on Taq, P\no and Pfu polymerase (Expand long range PCR kit, XL-PCR) 
both from Roche. All PCR products were visualised by agarose gel-e!ectrophoresls and 
PCR products either purified from the gel using the gel-extraction kit (Qiagen) or from 
the reaction mixture by PCR purification spin columns (GENOMED). PCR based one step 
gene disruption was performed using 50 ng of plasmid from the pRS400 series 
10 (Brachmann et al., 1998) as template. Amplification of the marker was performed with 
oligonucleotides having 20 nucleotides towards the plasmid and additional 50 
nucleotides flanking the target gene (Table 2). All other DNA manipulations were 
carried out according to standard procedures (Sambrook et al., 1989). 

15 Plasmid constructions 

Expression of proCCK was performed in pRS426 [2\x URA3] (Brachmann et al., 1998) 
using the phosphoglycerate kinase promoter {PGKlp) and terminator (PGKlt). The 
PGK1 promoter was amplified with PGKlp 5'H/ndIII and PGKlp 3'MCS (Table 2) using 

20 100 ng of genomic yeast DNA as template and subsequently cloned into pGEM-ll 
(Promega) in the HfndUl and SacI restriction enzyme sites. The terminator was 
amplified with PGKlt S'BglU and PGKlt 3'SacI (Table 2) and ligated into the plasmid 
containing the promoter at the SacI and EcoRI restriction enzyme sites. This construct, 
pGEM-ll PGKlpMCSPGKlt then contained the PG/CI-promoter, a multiple cloning site 

25 {MCS) with the restriction enzyme sites EcoRI, BamHl, Xbal and Bgfll followed by the 
PGK1 terminator. The preproMfalp-proCCK fusion (Rourke et al., 1997) (Fig. 1) was 
subcloned into the EcoRI and Xbal sites of pGEM-ll PGKlpMCSPGKlt and finally the 
entire gene was cloned into pRS423 as well as pRS426 to complete the yeast CCK 
expression constructs, pRS423 preproMfctlp-proCCK and pRS426 preproMfctlp-proCCK 

30 respectively. Expression of CYM1 on a multi copy plasmid was constructed by 

amplification of the open reading frame (ORF) of CYM1 and additional 926 bp at the 5' 
end and 703 bp at the 3' end. The amplification was carried out by XL-PCR using 100 
ng of genomic yeast DNA and the oligonucleotides, CYM1 5'Apal and 
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Table 2. Oligonucleotides used. 
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Oligo 



Oligonucleotide sequence <5'-3') 



Purpose 



PGKIpS'HindWX 
PGKIpVMCS 

PGKI&BglU 
PGKItVSacl 

CCK3'-^>fll 
CCK-22 K->A(S) 
CCK-22 K-»A (AS) 
CCK-22 K->R(S) 

CCK-22 K->R(AS) 

CCK-22 seq 

CYMIS % Apa\ 

CYMWXhoX 

KEX2S* 

KEX2T 

KEX2DCS* 

KEX2DC3* 

TRPiyNdel 

TRPWAvrU 
*E*/5'GD400 

tf£,Y/3GD400 

/C£\75'DC400 
AT£,V/3'DC400 
Cm/5 , GD400 



AATACAAGCTTGTCGACTGATCTATCCAAAACTG (SEQ ID 
NO: 16) 

AA AAGAGCTCGGCC AG ATCTTCTAG AGG ATCCAA 
GAATTCTGTTTTATATTTGTTGTAAAAAGTAG (SCO ID NO: 17) 
TTTTGAATTCCAAGATCTCCCATGTCTCTACTGGTGG (SEQ ID 
NO: 18) 

CCCCGAGCTCGTCGACCCTTCTCGAAAGCTTTAACGAACGC 
(SEQ ID NO: 19) 

TTTTGAATTCAAAGAATGAGATTICCrTCAATTTTTACTGCAG 
(SEQ ID NO: 20) 

TTTT J C TAGACTAGGAGGGGTACTCATACTCCTCGGC (SEQ ID 
NO: 21) 

CGAATGTCCATCGTTGCGAACCTGCAGAACCTG (SEQ ID NO: 
22) 

CAGGTTCTGCAGGTTCeiAACGATGGACATrCG (SEQ ID NO: 
23) 

CGAATGTCCATCGlT^QQAACCrGCAGAACCTG (SEQ ID NO: 
24) 

CAGGTTCTGCAGGTTCCTAACGATGGACATTCG (SEQ ID NO: 
25) 

TCGCAGAGAACGGATGGC (SEQ ID NO: 26) 

TTTTGGGCCCTTCATGGTGATACGGTATCTCTTGGC (SEQ ID 
NO: 27) 

TTTTCTCGAGAAGGTGGAACATACTGCCCTGGGATGG (SEQ 
ID NO: 28) 

TTTTGAGCTCGTTTAGGAAACGTCCTTGGCGGAGATGC (SEQ 
ID NO: 29) 

TTTTTCTAG AC ACTGCGA ATCCATGGTATAA ACC A A A ACC 
(SEQ ID NO: 30) 

GTCGTTGTTCATGGACATACCTCC (SEQ ID NO: 31) 

TACAAATGTTCTTCTGCCATTTCTGG (SEQ ID NO: 32) 

GGTTCATATGCGCCGGAGCTCCTCGACAGCAG (SEQ ID NO: 
33) 

GGTTCCTA GG ATCCGC AAGTTTG ATTCCATTGCGGTG (SEQ ID 
NO: 34) 

TTAAAGAGTACCTTGGCTATAGAATACCGTAGAGATAAAGA 
CCTGAATAGAGATTGTACTGAGAGTGCAC (SEQ ID NO: 35) 
AGGTATTATAACTATTTTTCrGTATTTTTTATATArrTTTATT^ 
GCCAAGCTGTGCGGTATTTCACACCG (SEQ ID NO: 36) 

CTTTGGTTAAAGAGTACCTTGGC (SEQ ID NO: 37) 

TACTACGAAAAGCGTGTGCGAGG (SEQ ID NO: 38) 

TAGAAGGCTACTCAAAAGAATAAAG'ITACTATAAAATATACT 
GCGGTATATAGATTGTACTGAGAGTGCAC (SEQ ID NO: 39) 



Expression construct 

Expression construct 
Expression construct 

Expression construct 

preproM fa I p-proCCK 

preproM fa I p-proCCK 

Lys^'-^AIa 61 mutation 



Lys 6, ->Ala 61 mutation 



Lys G, ->Arg 6! mutation 



Lys 6, ->Arg 61 mutation 



Sequencing 
Cloning of CYMI 
Cloning of CYMI 
Cloning of KEX2 
Cloning of KEX2 
Control of Akex2 
Control of Akex2 
Cloning of TRPl 
Cloning of TRPl 

KEXI deletion 

KEXI deletion 
Control of bkexl 
Control of bkexl 

CYMI deletion 
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CrAf/3'GD400 



GATCGGCAAGAAACnTCAAGCAGTATATTTACAGGATTAAA 
TTATATATCTGTGCGGTATTTCACACCG (SEQ ID NO: 40) 



CYMI deletion 



CYMI 5'DC400 



CGGAGGGGCTCTATGATAAAGG (SEQ ID NO: 41) 



Control of Acymf 
Control of kcyml 



CyA*/3'DC400 



GAGTAACTAGGGClTCrCTTCCC (SEQ !DNO: 42) 



CYMI 3'Xhol (Table 2). The PCR product was purified on spin columns and 
subsequently cloned into the Apal and Xhol restriction enzyme sites of pRS425. 

5 

The Lys 61 residue, believed to be crucial for the proteolysis of proCCK to release 
CCK-22, was exchanged by Ala by site-directed mutagenesis (Horton et aL, 1993). The 
exchange was performed by PCR using the Pwo polymerase (Boehringer Mannheim), 
where two products were amplified with the oligonucleotides sets, PGKlpS' Hfndlll / 

10 CCK-22 K-»A (antisense) and CCK-22 K->A (sense) / PGKlt3' Sad (Table 2) and 50 ng 
of pRS426 preproMfctlp-proCCK as template to each reaction. The two products were 
subjected to agarose gel-electrophoresls and approximately 1 mm 2 of each product 
where cut out and used directly as template in a third PCR reaction. In this reaction the 
fulMength cDNA encoding the fusion protein was amplified using PGKlpS' Hindlll and 

15 PGKU3' Sad (Table 2). The PCR product was subcloned into pCR-Blunt II (Invitrogen) 
and sequenced with the CCK specific primer, CCK-22 seq. Finally the PGKlp 
preproMfalp-proCCK (K->A) PGKlt product was cloned into the Hindlll and Sad sites 
of pRS426 to construct the expression plasmld, proCCK (K-»A). Substitution of Lys with 
Arg was performed as described above by exchanging the CCK specific primers with 

20 CCK-22 K-»R (antisense) and CCK-22 K->R (sense) to construct the proCCK (K-*R) 
vector. 

Strain construction 

25 Construction of a partial KEX2 disruption was performed in BJ2168 by amplification of 
the entire KEX2 gene with 1000 bp on each site of the ORF by XL-PCR using 100 ng of 
genomic yeast DNA and the oligonucleotides, KEX2 S'Sad and KEX2 3'Xbal (Table 2). 
The PCR product was purified on spin columns and cloned into pCR-Blunt II 
(Invitrogen). Amplification of TRP1 was performed by XL-PCR Introducing an A/del site 

30 925 bp 5' to the ORF and an Avrll site 212 bp 3' to the stop codon using TRPl S'Ndel 
and TRPl 3'Avrll (Table 2). The PCR product was purified and subcloned into the Ndel 
and Avrll sites of KEX2 eliminating 2018 bp of KEX2 and 170 bp of the promoter. The 
kex2::TRPl construct was excised from pCR- Blunt II using the Notl and Spel restriction 
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enzymes and subsequently transformed into BJ2168. Transformants were selected on 
SC-Trp plates followed by a colony PCR screen to test for correct Integration using 
oligonucleotides that cover the entire marker plus an additional 1200 bp on each site of 
KEX2, kex2 DCS' and kex2 DC3' (Table 2). Construction of a kex2 kexl strain was 
5 performed by the two step gene disruption technique (Rothstein, 1991) using the LEU2 
marker. Amplification of LEU2 was performed by XL-PCR using 50 ng of pRS405 as 
template and *exi5'GD400 and texJ3'6D400 (Table 2). The PCR product was purified 
using PCR purification spin kit (GENOMED) and subsequently transformed into UY23. 
Transformants were selected on SC-Leu plates and correct integration was tested by 
10 PCR-based colony screen using JcexlS'DC and Jcexl^DC (Table 2). 

A Acyml::LEU2 (UY430) strain in a BJ2168 background was constructed by the one 
step gene disruption technique as described above for the Akexl strain using the 
oligonucleotides, CWJ5'GD400 and CW23 I GD400 (Table 2) for gene disruption and 

15 cymlS'DC and cy/nI3'DC (Table 2) for disruption control. All null mutants created by 
this method were prepared with oligonucleotides designed towards the 50 bases 
adjacent to the 5' and 3* UTR with a specific 3' end to the pRS400 series of vectors 
containing various markers (Brachmann et al., 1998), Transformants were selected on 
appropriate agar plates followed by a colony PCR screen to test for correct integration 

20 using oligonucleotides that cover the entire marker plus an additional 200 bp on each 
site. Only the oligonucleotides that are not positioned as described above are shown in 
Table 2. 

Gene deletions of STE24, AXL1, PRD1 and YIL108w were made in BJ2168 using the 
25 PCR disruption technique (Brachmann et aL, 1998) and pRS405 {LEUZ\ as template. 

The UY123, which contain gene deletions of APE1, -2 and -3, was derived from Y14953 
using PCR disruption technique (Brachmann et a!., 1998). APE2 was initially replaced 
with the LYS2 (pRS317 [cen; LYS2]) where the PCR product was purified from agarose 
30 gel prior to transformation and APE3 was substituted with the LEU2 marker (pRS405 
ILEU21). 

The ypsl yps2 yps3 triple mutant (UY15) was constructed in BY4705 using the PCR 
disruption technique (Brachmann et al., 1998). The ORF of YPSX were initially deleted 
35 by insertion of the TRPl locus (pRS404) to generate UY13. This strain was then used 
as host for the deletion of YPS3 by insertion of the LEU2 marker (pRS405) and finally 
the YPS2 was deleted by insertion of the URA3 marker by amplification of pRS406 
[URA3] to construct UY15 (Table I). 



29 

CCK and CYM1 expression 

Human proCCK was expressed as a fusion protein between the prepro leader sequence 
of yeast a-mating factor and proCCK (preproMfalp-proCCK). The fusion construct was 
5 expressed on multi-copy plasmids, with constitutively gene transcription from the 
phosphoglycerate kinase promoter. "ProCCK expression" refers to expression using 
pRS426 preproMfalp-proCCK, which was used in all yeast strains with exception of 
BY4705 and the isogenic yapsins deletion strains, where proCCK was expressed from 
pRS423 preproMfalp-proCCK. CYM1 expression was driven by its own promoter. 
10 Plasmid constructs, and oligonucleotides used are listed in Table 2. 

Enzymatic treatment 

Trypsin treatment was performed using 1 mg/ml Trypsin (Worthington Biochemical 
15 Corporation) in a 50 mM sodium phosphate buffer (pH 7.5) for 30 min at RT and 

terminated by immersion into boiling water for 10 mln. Carboxypeptidase B (Boehringer 
Mannheim) treatment with a final concentration of 4 pg/ml was performed in 0.1 mM 
sodium phosphate buffer (pH 7.5) at room temperature for 30 min. The reaction was 
terminated by immersion into boiling water for 10 min. 

20 

Gel chromatography 

Yeast transformants grown to late exponential phase were centrifuged at 15000 g to 
collect the cells and 500 \i\ of the medium was loaded directly onto a Sephadex G-50 
25 superfine (Pharmacia) column (1x100 cm) at 4°C. The sample was eluted in VBA buffer 
(20 mM barbital buffer, 0.11% bovine serum albumin and 0.6 mM thiomersal) at a flow 
rate of 3.5 ml/h and fractions were collected every 17 min. Calibrations were performed 
by including 125 I-aibumin {V D ) and 22 NaCI (V,). The elution constants K d , of peaks eluting 
at V e are calculated as K d = {V e -V 0 )/(V r V 0 ). 

30 

Radioimmunoassay 

Two different antisera were used to determine the amount of processed 
cholecystokinin. Ab 89009 (Paloheimo et al. f 1994) Is specific for the N-terminus of 
35 CCK-22 and Ab 7270 (Hilsted et al., 1986) is specific for Gly-extended CCK. The 

fraction of CCK processed to CCK-22 is calculated by division of the Immunoreactiyity 
measured with Ab 89009 with the amount measured with the same antibody after the 
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sample was treated with trypsin to measure the total amount of N-terminal extended 
CCK-22. 

Yeast extract and protease assay 

5 

Ten A 6 oo units of yeast cells growing in exponential phase were sedimented by 
centrifugation at 3000 g for 5 min, washed once in 25 ml H 2 0 and transferred to a 2 ml 
Eppendorf tube. An equal amount of acid washed glass beads (Sigma-Aldrich) was 
added followed by 200 pi of 0.1 M NaH 2 PCX, (pH 4.5) including various inhibitors (150 

10 |iM Bestatin, 30 pM E-64, 10 \iM Leupeptin, 1 |iM Pepstatin A, 1 mM 

phenylmethylsulfonyl fluoride (PMSF), 1 mM EDTA and 1 mM 1,10-orthophenanthroIine 
or 1 tablet complete inhibitor with or without EDTA per 2.5 ml 0.1 M NaH 2 P0 4 
(Boehringer Mannheim). The cells were broken by vortexing 3x20 sec and the extracts 
were clarified by centrifugation at 15000 g for 10 min. All steps were carried out at 4°C. 

15 The protease assay was performed using 20 pmol synthetic amldated CCK-33 

(Peninsula Laboratorie Europe, Merseyside, England) or Ao-CCK-33-Gly (Cambridge 
Research Biochemicals, Stockton, England) as substrate, 20 m' yeast extract, various 
inhibitors and activators in a total volume of 30 nl. The mixture was incubated at 30°C 
for 1 h and the reaction terminated by adding 500 jil VBA buffer followed by immediate 

20 immersion into a boiling water bath for 10 min. 

Protease assay using metalloprotease deficient strains 

The assay was performed as described above, but with addition of 1 mM Bestatin and 1 
25 mM Mn 2+ to decrease N-terminal degradation. 

Protease assay using intact yeast cells 

Five A 6 oo units of exponential growing cells where sedimented, washed once in 5 ml H 2 0 
30 and once in SC media (pH 6.0), before the cells where resuspended in 25 \i\ SC media. 
The protease assay was performed by addition of 20 pmol synthetic Ac-CCK-33-Gly as 
substrate and the mixture incubated with gentle shaking at 30°C for 1 h. The reaction 
was terminated by addition of 500 \i\ VBA and the cells removed by centrifugation 
before the supernatant was immersed into boiling water for 10 min. 

35 
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Analysis of secreted CCK by MALDI-TOF 

Fifty Afioo units of CCK transformed yeast were subjected to 25 ml of fresh media, 
followed by Inoculation for 3 h. Cells were removed by centrifugation at 15000 g for 10 
5 min and 500 ul of media was concentrated and desalted by reverse phase using a 

ZipTip Cis column (Mlllipore). The peptides were eluted with 10 pi 50% acetonltrlle. The 
purified peptides were analysed In a Matrix Assisted Laser Desorptlon/Ionization time- 
of-flight mass spectrometer (Biflex, Bruker-Franzen, Bremen, Germany) operated in 
the reflected mode using time lag focusing (delayed extraction). For analysis, 0.5 \i\ of 
10 the sample was mixed with 0.5 \i\ matrix solution (a-cyano-4-hydroxycinnamic acid in 
acetonitrlte/methanol, Hewlett Packard). Then 0.5 h' of the mixture was applied to the 
probe and allowed to dry before Introduction into the mass spectrometer. 

Statistical analysis 

15 

Statistical calculations were performed using an unpaired students t-test to analyse 
whether the change in proCCK expression or the fraction of mature CCK-22 between 
wild type yeast expressing proCCK and mutants isogenic to the wild type strain can be 
considered to be statistically significant. The P-value calculated for CCK-22 processing 
20 between yapsin mutants are comparisons of BY4705 and each mutant, whereas the 
brackets represents comparisons between BY4705 ypsl and BY4705 ypsl yps3 and, 
3Y4705 ypsl yps3 and BY4705 ypsl yps2 yps3, respectively. 

Results 

25 The influence of growth conditions on the CCK-22 processing 

The intra- and extracellular fraction of CCK-22 was measured from BJ2168 expressing 
proCCK. The intracellular fraction remained unaltered whether the cells were In 
exponential growth or had reached stationary phase (Fig. 2). However, the relative 
30 amount of secreted CCK-22 changed dramatically when the cells reached stationary 
phase. During exponential growth the fraction of CCK-22 was 23%, but in the 
stationary phase (after 270 min) the fraction Increased to 37% (Fig. 2). Hence, for the 
experiments described herein only exponentially growing cells were used. 

35 The significance of the Lys residue in the release of CCK-22 



32 

To evaluate the role of the Lys residue in proteolysis, transformants of BJ2168 with the 
two expression constructs, proCCK and proCCK (K->A) were grown to late exponential 
phase and the culture media collected. The media from each strain was subjected to gel 
chromatography and the content of Gly-extended CCK in the collected fractions where 
5 measured with Ab 7270. CCK from the wild type media eluted in two major peaks at K d 
= 0.8 and 1.1 (Fig. 3 A) in accordance with the previously established elution positions 
for CCK-22-Gly and CCK-8-Gly (Cantor et al. f 1987; Rourke et al., 1997), while the 
proCCK (K-»A) only gave rise to CCK-8-Gly and a larger form eluting at a K d = 0.6 (Fig. 
3 C). The two peaks eluting at K d - 0.7 and 0.8 for the wt construct (Fig. 3 B) 

10 correspond to C-terminally extended CCK-22 and CCK-22-Gly, respectively (Rourke et 
al., 1997), whereas no CCK-22 immunoreactivity was observed in these positions for 
the proCCK (K->A) construct (Fig. 3 D). However, a small peak of immunoreactivity was 
seen at K d = 0.55 which may be due to the slight cross reactivity of Ab 89009 with a 
larger unprocessed fragment (Paloheimo et al., 1994). To investigate the effect of 

15 substituting Arg for Lys, proCCK (K->R) was transformed into BJ2168. Media from 

transformants where analysed before and after tryptlc cleavage. The fraction of proCCK 
processed to CCK-22 was similar to that seen for wild type CCK (Fig. 11). 

Analysis of secreted CCK peptides by mass spectrometry 

20 

Media collected from BJ2168 transformed with proCCK were analysed by mass 
spectrometry (Figure 12). The fragments obtained correspond to the processing leading 
to CCK-39, CCK-22 and CCK-8. Two peptides were identified N-terminal of Lys 61 (Tyr 45 - 
Val fi0 , 1805.0 Da and Tyr 45 - Lys 61 , 1932.2 Da). It appeared likely that the former was a 

25 carboxypeptidase degradation product of the latter. To elucidate this question and in an 
attempt to identify the C-terminal extended CCKs, the present inventors produced a 
disruption strain in which both KEX2, encoding the serine protease responsible for the 
processing to CCK-8, and the carboxypeptidase encoded by KEX1 were mutated. 
Following transformation of proCCK into this kex2 kexl strain (UY22) and subsequent 

30 analysis of the secreted peptides the inventors found only the peak corresponding to 
Tyr 45 -Lys 61 . The same pattern, with only the peak corresponding to Tyr 45 -Lys 61 was 
seen using single gene disruption of KEX1 and KEX2 to express proCCK (data not 
shown). Thus the Tyr 45 -Val 60 must be a degradation product in accordance with CCK-22 
arising from cleavage after Lys 61 , Additional fragments were discovered by CCK 

35 expression in the kex2 kexl strain corresponding to processing leading to CCK-61 (not 
identified in mammals), CCK-58, C-terminal extended CCK-39 and C-terminal extended 
CCK-22 (Figure 12), whereas none of the peptides corresponding to CCK-8 could be 
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identified, in accordance with our previous work showing that Kex2p is responsible for 
this processing (Rourke et al., 1997). 

Kex2p is involved in the biosynthesis of CCK-22 

5 

Previous analysis of CCK peptides secreted from a kex2 strain as well as the results 
obtained by mass spectrometry Indicate that the cleavage at Lys 61 releasing CCK-22 
can occur without the involvement of Kex2p. However, the kex2 strain, shows a 
decrease in CCK-22 concentration. ProCCK was expressed both In the vacuole protease 

10 deficient and the isogenic kex2 strain (BJ2168 and UY23) and the processed intra- and 
extracellular fractions of CCK-22 from exponentially growing cells were measured. 
Approximately 28% of the intracellular CCK content was processed after Lys 6X in 
BJ2168, whereas only 6% was processed within the kex2 strain. Analysis of secreted 
CCK peptides showed that the media collected from BJ2168 expressing proCCK 

15 contained approximately 20% CCK-22, whereas from the kex2 mutant, the amount was 
reduced to 5%. These results indicate that Kex2p is involved in the processing leading 
to CCK-22. However, there are other proteases that can perform the cleavage at Lys 61 . 

In vitro assay of Lys 61 cleavage 

20 

To investigate the nature of the protease(s) In addition to Kex2p that are able to 
perform the endoproteolytlc cleavage after the single Lys 61 residue of proCCK, an in 
vitro assay was established using crude preparations from of S. cerevisiae and 
synthetic CCK-33 as substrate. 

25 

Using extract from the vacuole protease deficient strain, BJ2168, there was an 
extensive N-termlnal degradation, and the recovery of measurable CCK was less than 
10% of the control without yeast extract. Because the assay depends on the intact 
N-terminus of CCK-22 for the antibody to bind, the inventors created a strain where 
30 some of the known S. cerevisiae aminopeptidases were deleted. The Y14953 strain 
(ape J) was used as parental strain in which the APE2 and APE3 genes were also 
deleted. Using this UY123 strain to prepare the cell extract there was a 2-3 fold better 
recovery of immunoreactivity compared to the recovery seen with BJ2168. 
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Processing to CCK-22 depends on metal ions 
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The nature of the protease performing the cleavage of synthetic human CCK-33 to 
CCK-22 was analysed by inclusion of a number of different inhibitors with the extract 
from UY123. The results showed that only the addition of a metal chelating agent 
Inhibited proteolysis of CCK-33 to CCK-22 (Fig. 4 A). 

5 

The metal dependency of the protease was tested in vitro, after the activity Initially was 
inhibited by addition of 1 mM EDTA. Reconstitute of the activity leading to maturation 
of CCK-22 was tested by addition of different divalent cations in 0.2 mM surplus. 
Addition of Zn 2+ , Co 2+ and Mn 2+ could reestablish the protease activity, whereas Ca 2+ , 
10 Cu 2 * or Mg 2 * had no effect (Fig. 4 B) in accordance with the properties of known 

metalioproteases, which are only activated by Zn 2 *, Co 2 * and Mn 2+ . Reactivation using 
increasing Zn 2+ concentrations showed a biphasic pattern, with Zn 2+ acting inhibitory at 
concentrations above 5 mM (data not shown). 

15 The time course of CCK-cleavage by Zn 2+ and Mn 2+ reactivated metalioproteases were 
analysed using ceil extract from UY123, after initial inhibition with 1 mM EDTA. 
Reactivation was performed by addition of 1.2 mM Zn 2 * or Mn 2+ followed by incubation 
for 30, 60 and 120 min. In this assay and the following in vitro protease assays the 
inventors used the N-terminal acetylated CCK-33-Gly (Ac-CCK-33-Gly) as substrate, 

20 which resulted in much slower non-specific degradation. Measurement of the CCK-22 
immunoreactivlty before and after tryptic cleavage using Ab 89009 showed no 
difference in the activation potency between Zn 2+ and Mn 2 * at 30 and 60 min, however 
after 120 min 10% more CCK-22 immunoreactivity was measured using Mn 2+ as 
activator compared to Zn 2+ (Fig. 5). This increase in immunoreactivity is probably due 

25 to an inhibition of degradation following addition of Mn 2f here as well as to the yeast 
cell extracts used in Table 3. 
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Table 3. Metalioproteases in Saccharornyces cerevlsiae. Search performed In Swiss- 
Prot Sequence Retrieval System (SRS) http://www.expasv.ch/ . Protease assay performed in 
35 two independent assays (A and B) using extracts from the metalloprotease deficient 
strains. The amount of CCK-22 is measured with Ab 89009 and the total amount of 
CCK is measured after tryptic cleavage with Ab 89009. Putative metalioproteases are 
marked with *. 




Name 


Swiss-Prot 
acc# 


ORF 


CCK-22 
fnM| 


Total CCK |i)M| 


Fraction 
CCK-22 








A 


D 
»l 


A 
A 2 


o 

■*2 


A|f Aj 


B1/B2 


AAP1 


P37898 


YHR047c 


1 7 


1 7 
J.Z 


1/x 


17 

J a. 


a no 


a ia 
U. IU 


AFG3 


P3992S 


YEROI7c 


7 ft 


ZA 


7^ 


7/1 
Z4 


fk to 
U. 12 


0. 10 


APE! 


P 14904 


YKL103c 


A A 

4.4 


J. 7 


j4 


7ff 


0.1 3 


0.13 


APE2 


P32454 


YKLI57w 


APE3 


P37302 


YBR286w 


DPP3 


008225 


YOL057W 


A 1 
4. J 


A A. 
4.0 


1 1 


1^ 

JO 


U. 14 


U.I J 


LTA4 


010740 


YNL045W 


j.y 


4.U 


J2 




V. 12 


A 1 1 


MIPI 


P35999 


YKLI34c 


1 A 

J.T 


1 1 
J.J 


J4 


jj 


A IA 


A 1 A 


PRDI 


P25375 


YCL057w 


2.4 


2.8 


28 


26 


0.09 


0.11 


QR17* 


P43122 


YDL104C 


2.8 


2.9 


23 


24 


0.12 


0.12 


RCA I 


P4034I 


YMR089c 


4.2 


3.5 


31 


27 


0.14 


0.13 
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O060I0 


YLR389C 


2.6 


2.3 


25 


20 


0.10 


0.12 


STE24 


P47I54 


YJRII7w 


3.4 


2.8 


!9 


17 


0.18 


0.16 


YBS4* 


P38244 


YBR074w 


2.6 


2.7 


27 


29 


0.10 


0.09 


YHR3* 


P3882 1 


YHRI 13w 


2.5 


2.4 


26 


26 


0.10 


0.09 


YHT2* 


P38836 


YHRI32c 


3.8 


3.3 


34 


32 


0.11 


0.10 


YIK8* 


P40483 


YIL108W 


5.7 


5.2 


43 


39 


0.13 


0.13 


Y1N7* 


P40462 


YIL137C 


3.5 


2.8 


23 


20 


0.15 


0.14 


YKI8* 


P36132 


YKR038C 


2.9 


2.1 


23 


I 20 


0.13 


0.11 


YMEI 


P327P5 


YPR024W 


2.3 


2.7 


25 


25 


0.09 


0.11 


MAS2 


PI19I4 


YHR024C 


ND. Lethal genes 


MASI 


PI 0507 


YLRI63c 


AXL1 


P4085I 


YPRJ22w 


3.5 


3.4 


31 


30 


0.11 


0.11 


CYM1* 


P32898 


YDR430C 


0.6 


0.4 


42 


39 


0.0! 


0.01 


YOJ8* 


O 12496 


YOL098C 


3.6 


3.8 


35 


34 


0.10 


0.12 
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Extracellular yapsin activity 

To investigate whether any protease activity is secreted or attached extracellular to the 
plasma membrane, the protease activity was assayed in media and with intact yeast 
5 cells. No degradation of CCK-33 occurred after 1 h of incubation at 30°C using media 
from exponential growing UY123 cells in accordance with earlier observations (Rourke 
et a!., 1997). During incubation with intact yeast ceils, cleavage to expose the N- 
terminus of CCK-22 could be measured (Fig. 6) however, this protease activity could 
not be abolished by the inhibitors investigated (data not shown). By using intact cells 

10 containing gene disruptions of YPS1, YPS2 and YPS3 (Table I) the fraction of processed 
CCK-22 decreases by deletion of each of the three aspartyi proteases compared to wild 
type cells (Fig. 6). These data show that the three proteases all have extracellular 
protease activity, which can cleave at Lys 61 in proCCK. Preliminary results indicate that 
gene disruption of YPS7 decreases extracellular Lys 61 processing in amounts 

15 comparable to the YPS1 deletion (unpublished results). 

Expression of proCCK in rnetalloprotease deficient strains 

Based on previously described metalloproteases in S. cerevisiae with endoproteolytic 
20 activity (Adames et al., 1995; Schmidt et al., 2000), gene deletion strains of AXL1 
(UY201) and STE24 (UY202) were initially prepared in BJ2168. ProCCK expression in 
these strains showed that proteolysis after Lys 61 was unchanged compared to wild type, 
and It was decided to test the remaining rnetalloprotease deficient strains UY123, 
UY203, UY204 and the rnetalloprotease deficient strains obtained through Euroscarf 
25 (Table I) for their ability to secrete CCK-22 (mitochondrial peptidases were not 

included). The CCK-22 immunoreactivity did not change significantly among the CCK 
producing rnetalloprotease deficient strains (data not shown), and no protease 
responsible for the processing of heterologous expressed proCCK to CCK-22 was 
identified by this approach. 

30 

CYM1 encodes a protease that can release the free N-terminus of CCK-22 

Cell extracts were prepared from each of the viable rnetalloprotease deficient strains 
and tested in the in vitro protease assay to investigate whether any reduction in 
35 proteolysis was measurable. In this assay 1 mM Mn 2+ and 1 mM bestatin were included 
prior to the addition of Ac-CCK-33-Gly, since it was found that the recovery was 80- 
90% compared to 30% without addition of these amlnopeptidase inhibitors (data not 
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shown). Deletion of CYMI almost abolished the protease activity, whereas none of the 
other metalloprotease deficient strains showed a significant change in the biosynthesis 
of CCK-22 (Table 3). 

5 Expression of CYMI on a multicopy plasmid increase the fraction of matured CCK-22 in 
vitro 

To determine whether the amount of synthetlzed CCK-22 correlates with the amount of 
Cymlp in vitro, Cymlp was expressed on a multicopy plasmid and the fraction of 

10 synthetized CCK-22 analysed over time. Cell extract from BJ2168 transformed with 
pRS425 CYMI and the control transformed with the empty pRS425 vector were used in 
the in vitro protease assay with 1 mM Mn 2+ in which the reactions were terminated 
after 15, 30, 45 and 60 min. The CCK-22 immunoreactivity was measured with Ab 
89009 and the remaining CCK-33 was measured with the same antibody after tryptic 

15 cleavage (Fig. 7). Expression of CYMI on a multicopy plasmid enhanced the rate of 
CCK-22 production several fold. However, the inventors also observed an increased 
degradation of CCK-33 and CCK-22 (Fig. 7 B). When the same experiment was 
performed at pH 6.0 and pH 7.5, there was a dramatically increased degradation and 
after 30 min incubation the CCK immunoreactivity was undetectable at pH 6.0 (data 

20 not shown). These results show the Lys-specific cleavage in CCK-22 maturation in vitro 
is dependent on the amount of Cymlp. 

Expression of proCCK in cyml mutant strain enhance CCK secretion 

25 To elucidate the role of CYMI in the biosynthesis of CCK-22 in vivo, gene deletions of 
CYMI were prepared in the vacuole protease deficient strain, BJ2168, and isogenic 
kex2 strain. Deletion of CYMI resulted in an approximately 40% increase in the total 
amount of proCCK within the cells (Fig. 8 A) accompanied by a similar decrease in 
CCK-22 independent of KEX2 disruption (Fig. 8 C). Also the secreted amount of total 

30 CCK in the cyml strains increased with more than 60% (Fig. 8 B), but unlike the 
fractional decrease in intracellular CCK-22 there was an increase in the extracellular 
fractions of CCK-22 compared to vacuole protease deficient strain and the isogenic 
kex2 strain (Fig. 8 D). 

35 Expression of CCK K->A mutant leads to intracellular CCK accumulation comparable to 
the accumulation of wild type CCK in a cyml strain 
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The observations that a gene disruption of CYM1 causes an increase in intracellular 
concentrations of CCK (Fig. 8 A) raises the question whether the proteolytic activity of 
Cymlp leads to degradation of CCK-22 prior to translocation into the ER. Therefore, the 
inventors examined the intracellular CCK content In strains expressing CCK where the 
5 maturation of CCK-22 has been eliminated by using the Lys 61 Ala 61 mutant. 

Transformants of this CCK mutant In the vacuole protease deficient strain, BJ2168 and 
the isogenic cyml strain were analysed using Ab 7270 after trypsin and 
carboxypeptidase B treatment and there was an increase in the Intracellular CCK 
immunoreactivity for this construct compared to expression of wild type CCK (Fig. 9). 
10 Mutant CCK (K->A) and wild type CCK transformants resulted in an increase in the 

intracellular proCCK concentration when expressed in BJ2168 and the CYM1 disruption 
strain, respectively. The increase in intracellular proCCK was not additive showing that 
proteolytic activity of Cymlp leads to degradation of CCK-22 prior to translocation into 
the ER. 

15 

Expression of proCCK in aspartyl protease deficient strains 

The Lys 61 -specific cleavage of proCCK was analysed in null mutants of YPS1 , -2 and -3, 
where the intra- and extracellular amount of CCK-22 was measured from exponentially 

20 growing cells of wild type yeast, BY4705 and the isogenic aspartyl protease deficient 
strains transformed L3Y13 r -14 and -15 with proCCK. Both intra- and extracellular CCK 
immunoreactivity of BY4705 was lowered more than 10 fold compared to the vacuolar 
protease deficient strain, BJ2168 (data not shown). The intracellular fraction of CCK-22 
decreased significantly from approximately 28% in wild type cells to 17% in the ypsl 

25 strain, whereas no additional decrease could be measured by gene disruptions of YPS2 
and YPS3 (Fig. 10 A). The extracellular fraction of CCK-22 did however show that 
Ypslp, Yps2p and Yps3p all are involved in the biosynthesis of CCK-22 and that the 
triple mutant reduced the fraction of CCK-22 to 2/3 compared to wild type yeast (Fig. 
10 B). 

30 

CYM1 disruption leads to a two-fold Increase In the total amount of secreted wild type 
CCK as well as the CCK K->R mutant 

To elucidate whether Cymlp cleaves C-terminally to a single Arg residue, the CCK 
35 (K->R) mutant was expressed in the vacuolar protease deficient strain, BJ2168 and the 
isogenic cyml strain. The concentration of both intra- and extracellular CCK was 
compared to wild type CCK expressed in these strains. The total amount of the mutated 
CCK (K->R) was increased both intra- and extracellular comparable to wild type CCK 
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(Fig. 11). Both wild type CCK and the Lys 6l -> Arg 61 mutant showed more than a two- 
fold increase In the measurable amount of extracellular CCK when expressed in the 
cyml strain (Fig. 11 B). 

5 Usage of Cymlp activity In the synthesis of peptides 

Another aspect of the invention is to use the activity from Cymlp, either expressed 
from its own promoter or from a strong constitutive promoter such as PGK1, ADH1 or 
TPI1, or the induceable GAL1 promoter to produce an Increased amount cytosolic 
10 Cymlp activity. As previously mentioned, the synthesis of CCK-22 is significantly 
increased when CYM1 is transcribed from its own promoter on a 2n plasmid (Fig. 7). 
Transcription can either be performed from a plasmid containing the promoter, CYM1 
and a terminator, or by introducing the desired promoter into the genome by 
heterologous recombination to substitute the endogenous promoter of CYMl. 

15 

The activity can be used intracellular to generate peptides that do not require post- 
translational modifications from the secretory pathway, such as disulfide bond 
formation, N- and O-glycosylation or exoprotease activity. 

20 The role of Cymlp cytosolic activity in intracellular peptide synthesis is shown in the 
biosynthesis of CCK-22 in wild type cells compared to the isogenic strain with a CYM1 
disruption, which shows a significant increase in the amount of CCK-22 (Fig. 8C). 
Synthesis of the peptide of interest should be performed in such a way that 
translocation into the endoplasmatic reticulum (ER) is avoided. This can be performed 

25 either by removal of the hydrophobic amlno-terminal signal sequence from proteins 
that enter the ER post-transiationally, or by expression in a temperature sensitive 
secretory mutant such as sec61, which abolishes translocation of secretory peptides 
into the ER when the temperature is elevated to 37°C. 

30 The propeptide or prepropeptide of interest will then be cytosolicly located and a 

potential substrate for Cymlp. Release of the peptide from its precursor will be carried 
out by the Cymlp activity by introduction of the cleavage site seen from proCCK, which 
results in the release of Gly-extended CCK-22 after endoproteolytic processing C- 
terminal to Lys 61 (Ser-Ile-Val-Lys 61 I) (Fig. 13A). If the peptide of interest is GLP1, 

35 synthesis can be performed as a fusion to a Cymlp cleavage site, which could be a part 
of proCCK (Fig. 13B). The peptide of interest will then accumulate in the cytosol and 
can be purified from sedimented cells after lysis. 
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Discussion 

The secreted polypeptides varies with the growth conditions, the fraction of CCK-22 
increasing when the culture reaches stationary phase, while the intracellular processed 
fraction remains unaltered under stress conditions. The increase in extracellular 
5 cleavage to CCK-22 as the cells enter stationary phase could indicate that extracellular 
endoproteases with the ability to process proCCK to CCK-22 are secreted or expressed 
on the celt membrane. It is known that the aspartyl proteases, Ypslp and Yps2p, 
exhibit cell surface activity (Komano et al., 1998). In addition, it has previously been 
shown that heterologous peptide expression in a ypsl strain improved the recovery of 
10 proteins and peptides like albumin, glucagon, GLP1, GLP2 and CART by inhibiting 

proteolysis C-termlnal to mono-basic residues (Egel-Mitani et al., 2000; Kerry-Williams 
et al., 1998). Thus, recent studies ((Egel-Mltani et al., 2000; Kerry-Williams et al., 
1998) and those of the present inventors) show the Importance of collecting secreted 
peptides during exponential growth in order to avoid additional extracellular processing. 

15 

ProCCK expressed in a vacuole protease deficient strain showed 30% intracellular 
processing at Lys 61 in proCCK. The fraction of extracellular Lys 61 -processing Is, 
however, decreased to 2/3 of the observed within intact yeast cells, which reveals an 
intracellular degradation of CCK-22 prior to secretion. The increase in extracellular 

20 proteolysis under limited nutrient resources Is probably due to an activation or 
upregulation in transcription of the extracellular proteases under limited nutrient 
resources as seen with the upregulation of YPS1 transcription during stationary phase 
(Gasch et al., 2000). Part of the cell surface activity can be assigned to the yapsins, 
Ypslp, Yps2p and Yps3p, but some extracellular activity was sustained even in the 

25 triple mutant. 

In the present study the inventors have shown that deletion of KEX2 causes a 5 fold 
reduction in both the Intracellular and extracellular Lys 61 -cleavage. The kex2 strain 
expressing proCCK do not only alter the cleavage of Lys 61 in proCCK, it also changes 

30 the intracellular retention time of CCK as the intracellular concentration of CCK peptides 
is reduced with more than 60%, while the extracellular CCK concentration Is increased 
almost 60% compared to wild type yeast. Moreover, analysis of the secreted CCK 
peptides from the kexl kex2 double mutant and the kex2 mutant showed 
disappearance of the Tyr 45 -Val 60 degradation product. Thus, the removal of Lys 61 by 

35 Kexlp was abolished in a kex2 strain indicating an enhanced secretion rate through the 
trans-Golgi network. These results and the observations on the rapid secretion of 
proCCK suggest that it may be the Intracellular retention caused by Kex2p that leads to 
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an increased synthesis of CCK-22 in wild type yeast by Ypslp and probably to some 
extent by Kex2p. 

The type of protease responsible for the intracellular maturation of CCK-22 was 
5 investigated in an in vitro protease assay using a crude extract of 5. cerevisiae to 
analyse the processing of synthetic human CCK-33 to CCK-22 in the presence of 
different Inhibitors. By not Including detergents in extraction of protease activity, 
activity from Kex2p as well as the GPI-anchored yapsins was avoided (Azaryan et a!., 
1993; Fuller et al„ 1989; Komano et al., 1999). Of the inhibitors tested, the proteolysis 
10 was only inhibited by EDTA and 1,10 ortho-phenanthrollne, and the activity could be 
restored by addition of the divalent cations Zn 2+ , Co 2+ and Mn 2+ . This indicated that a 
metalloprotease participate in the maturation of CCK-22. 

None of the candidate metalloproteases contain an obvious signal peptide to direct the 
15 protein into ER. Therefore, the inventors investigated strains deficient in each of the 
metalloprotease with the exception of mitochondrial proteases. Expression of proCCK in 
each of the strains resulted in unaltered maturation of CCK-22 similar to that seen in 
wild type yeast. However, by using the In vitro protease assay the inventors identified 
Cymlp as an endoprotease performing post-Lys cleavage of CCK-33. That Cymlp can 
20 cleave Lys 61 In proCCK was verified by overexpresslon studies, showing a several fold 
increase in enzyme activity. 

Intracellular synthesis of CCK-22 was decreased in a cyml strain accompanied by an 
increased concentration of total proCCK. In contrast, the fraction of extracellular 

25 CCK-22 was increased compared to wild type yeast with a parallel increase in total 
CCK. These findings are in accordance with a cytosolic location of the Cymlp activity 
like most insulin-degrading enzymes (Bai et al., 1996) and shows that it acts on the 
preproMfalp-proCCK construct prior to translocation into the endoplasmatic reticulum. 
Thus, the pre-translocational degradation of proCCK is decreased by CYM1 disruption 

30 and the total production increased. 

All publications discussed above are incorporated herein in their entirety. 

It will be appreciated by persons skilled in the art that numerous variations and/or 
35 modifications may be made to the Invention as shown In the specific embodiments 
without departing from the spirit or scope of the invention as broadly described. The 
present embodiments are, therefore, to be considered in all respects as illustrative and 
not restrictive. 
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Modtaget PVS 
2 0 SEP. 2002 

Claims 

1. A method for producing a protein of interest In a host cell, wherein said host cell has 
been genetically modified in order to express significantly reduced levels of a 
metalloprotease which is at feast 80% identical to the SEQ ID NO: 4 as compared to a 
5 parental cell, when cultured under identical conditions, comprising 

a) introducing into the host cell a nucleic acid sequence encoding the protein of 
interest; 

10 b) cultivating the host ceil of step (a) In a suitable growth medium for production of 

the protein of interest and 

c) isolating the protein of interst. 

15 2. The method according to claim 1, in which the host cell is a prokaryotic cell. 

3. The method according to claim 1, in which the host cell is a eukaryotic cell. 

4. The method according to claim 3, In which the host cell is a non-filamentous fungal cell. 

20 

5. The method according to claim 3, in which the host cell is a filamentous fungal cell. 

6. The method according to claim 4, in which the host cell is a strain of Saccharomycces. 
25 7. The method according to claim 6, in which the host cell is Saccharomyces cerevisiae. 

8. The method according to any of the proceeding claims, wherein the host cell has been 
genetically modified by a method selected form the group comprising gene knock-out, 
gene disruption, random or site directed mutagenesis, introduction of dominant-negative 

30 metalloproteases, RNA interference (RNAi) using dsRNA, catalytic nucleic acids (such as 
ribozymes and DNAzymes), antisense nucleic acids or a combination thereof. 

9. The method according to any of the proceeding claims, wherein the level of expressed 
metalloprotease in the host cell is reduced more than about 50%, such as more than about 

35 80%, such as more than about 90%, or such as more than about 95%. 

10. The method according to any of the proceeding claims, wherein the host cell Is 
essentially free of any metalloprotease activity. 

40 11. The method according to any of the proceeding claims, in which the protein of interest 
Is a eucaryotic protein, selected from the group comprising insulin, growth hormone, 
glucagon, somatostatin, inteferon, PDGF, factor VII, factor Viii, urokinase, EPO, chymosin, 
tissue plasminogen activator, or serum albumin. 
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12. The method according to any of the proceeding claims, in which the protein of Interest 
Is a protein of fungal origin, selected from the group comprising an amylolytic enzyme, an 
alpha-amylase, a beta-amylase, a glycoamylase, a alpha-galaotosidase, a cellulytic 

5 enzyme, a lipolytic enzyme, a xylanolytic enzyme, a proteolytic enzyme, an 
oxldoreductase, a peroxidase, a laccase, a pectinase, or a cutinase. 

13. The method according to any of the proceeding claims, in which the protein of interst 
is a bacterial protein, selected from the group comprising an amylolytic enzyme, an alpha- 

10 amylase, a beta-amylase, a glycoamylase, a beta-galactosidase, a cellulytic enzyme, a 
lipolytic enzyme, a xylanolytic enzyme, a proteolytic enzyme, an oxidoreductase, a 
peroxidase, a laccase, a pectinase, or a cutinase. 

14. The method according to any of the proceeding claims. In which the protein of interest 
15 is a precursor, i.e. a zymogen, a hybrid protein, a protein obtained as a pro sequence or 

pre-pro sequence, or in unmaturated form. 

15. A host cell useful for the expression of a protein of interest, wherein said cell has been 
genetically modified in order to express significantly reduced levels of a metalloprotease 

20 which is at least 80% identical to the SEQ ID NO: 4, as compared to a parental cell. 

16. A host cell according to claim 15, wherein said cell is a prokaryotic cell. 

17. A host cell according to claim 16, wherein said cell is a eukaryotic ceil. 

25 

18. A host cell according to claim 17, wherein said cell is a non-filamentous fungal cell. 

19. A host cell according to claim 18, wherein said cell is a filamentous fungai cell. 

30 20. A host cell according to claim 19, wherein said host cell is a strain of Saccharomycces. 

21. A host cell according to claim 20, wherein said host cell Is Saccharomyces cerevis/ae. 

22. A host cell according to any of claims 15-21,wherein said cell has been genetically 
35 modified by a method selected from the group comprising gene knock-out, gene 

disruption, random or site directed mutagenesis, introduction of dominant-negative 
metalloproteases, RNA interference (RNAi) using dsRNA, catalytic nucleic acids (such as 
ribozymes and DNAzymes), antisense nucleic acids or a combination thereof. 

40 23. A host cell according to any of claims 15-22, wherein the level of expressed 

metalloprotease in said cell is reduced more than about 50%, such as more than about 
80%, such as more than about 90%, such as more than about 95%. 
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24. A host cell according to any of claims 15-23, wherein said cell is essentially free of any 
metalloprotease activity, 

25. A host cell according to any of claims 15-24, wherein the protein of Interest is an 

5 eucaryotic protein, selected from the group comprising insulin, growth hormone, glucagon, 
somatostatin, inteferon, PDGF, factor VII, factor Viii, urokinase, EPO, chymosin, tissue 
plasminogen activator, or serum albumin. 

26. A host cell according to any of claims 15-25, wherein the protein of interest is a protein 
10 of fungal origin, selected from the group comprising an amylolytic enzyme, an alpha- 

amylase, a beta-amylase, a glycoamylase, a alpha-galactosidase, a cellulytic enzyme, a 
lipolytic enzyme, a xylanolytic enzyme, a proteolytic enzyme, an oxidoreductase, a 
peroxidase, a laccase, a pectinase, or a cutlnase. 

15 27. A host cell according to any of claims 15-26, wherein the protein of interest is a 
bacterial protein, selected from the group comprising an amylolytic enzyme, an alpha- 
amylase, a beta-amylase, a glycoamylase, a beta-galactostdase, a cellulytic enzyme, a 
lipolytic enzyme, a xylanolytic enzyme, a proteolytic enzyme, an oxidoreductase, a 
peroxidase, a laccase, a pectinase, or a cutinase. 

20 

28. A host cell according to any of claims 15-27, wherein the protein of interest is a 
precursor, i.e. a zymogen, a hybrid protein, a protein obtained as a pro sequence or pre- 
pro sequence, or in unmaturated form. 
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<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
(5) . : (5) 

X - any amino acid 



MISC — FEATURE 
(6) . . (6) 

X = any amino acid 



MISC_FEATURE 
(9) . . (9) 

X = any amino acid 



MISC_FEATURE 
(10) . . (10) 
X = any amino acid 



MISC_ FEATURE 
(11) - . (ID 
X = any amino acid 



MISC_FBATURE 
(12) . . (12) 
X - any amino acid 



MI SC — FEATURE 
(14) ♦ . (14) 
X - any amino acid 



MI SC — FEATURE 

(15) . . (15) 

X = any amino acid 



MISC_FEATURE 
(16) . . (16) 
X = any amino acid 



MISC_FEATURE 

(17) . . (17) 

X = any amino acid 



3/64 



<220> 

<221> MISCJTEATORE 

<222> (18).. (18) 

<223> X - any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (19).. (19) 

<223> X = any amino acid 



<220> 

<221> MISCJFEATURE 

<222> (20).. (20) 

<223> X = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (21) . . (21) 

<223> X = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (22) . . (22) 

<223> X « any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (23).. (23) 

<223> X - any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (24).. (24) 

<223> X » any amino acid 



<220> 

< 2 2 1 > MI SC_FEATU RE 

<222> (25).. (25) 

<223> X = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (26).. (26) 

<223> X = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (27) . . (27) 

<223> X = any amino acid 
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<220> 

<221> MISC_FEATURE 

<222> (28).. (28) 

<223> X « any amino acid 



<220> 

<22 1 > MISC_FEATURE 

<222> (29).. (29) 

<223> X = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (30).. (30) 

<223> X » any amino acid 



<220> 

<221> M I SC__FEATU RE 

<222> (31).. (31) 

<223> X » any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (32).. (32) 

<223> X = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (33).. (33) 

<223> X = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (34). .(34) 

<223> X « any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (35).. (35) 

<223> X « any amino acid 



<220> 

< 2 2 1 > MI SC_FEATU RE 

<222> (36).. (36) 

<223> X = any amino acid or is absent 



<220> 

<221> MISC FEATURE 

<222> (37)7. (37) 

<223> X = any amino acid or is absent 
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<220> 

<221> MISC_FEATURE 

<222> (40).. (40) 

<223> X = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (42).. (42) 

<223> X « any amino acid 



<220> 

<221> MISC FEATURE 

<222> (43)7. (43) 

<223> X = any amino acid 



<220> 

<2 2 1 > MI SC_FEATURE 

<222> (44).. (44) 

<223> X = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (45).. (45) 

<223> X - any amino acid 



<400> 2 

Gly Xaa Xaa His Xaa Xaa Glu His Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa 
1 5 10 15 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 



Xaa Xaa Xaa Xaa Xaa Asn Ala Xaa Thr Xaa Xaa Xaa Xaa Thr 
35 40 45 



<210> 3 

<211> 46 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Pitrilysin consensus sequence 
<220> 

<221> MI SC_FEATU RE 

<222> (2).. (2) 

<223> X - any amino acid 
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<220> 

<221> MISC_FEATURE 

<222> (3).. (3) 

<223> X =» any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (5) . . (5) 

<223> X = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (6). -(6) 

<223> X «=■ any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (9). -(9) 

<223> X = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (10).. (10) 

<223> X = any amino acid 



<220> 

<221> MI SC_ FEATURE 

<222> (11).. (11) 

<223> X » any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (12).. (12) 

<223> X =» any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (14).. (14) 

<223> X = S or T 



<220> 

<221> MISC_FEATURE 

<222> (15).. (15) 

<223> X » any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (19) . . (19) 

<223> X » any amino acid 
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<220> 

<221> MISC_FEATURE 

<222> (20).. (20) 

<223> X = any amino acid 



<220> 

<221> MISC FEATURE 

<222> (21)7.(21) 

<223> X = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (22).. (22) 

<223> X = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (23).. (23) 

<223> X = any amino acid 



<220> 

<221> MISC FEATURE 

<222> (24)7.(24) 

<223> X = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (25).. (25) 

<223> X « any amino acid 



<220> 

<221> MI SC — FEATURE 

<222> (26).. (26) 

<223> X » any amino acid 



<220> 

< 2 2 1 > MI SC_FEATURE 

<222> (27).. (27) 

<223> X = any amino acid or is absent 



<220> 

<221> MISC_FEATURE 

<222> (28).. (28) 

<223> X « any amino acid or is absent 



<220> 

<221> MISC_FEATURE 

<222> (30).. (30) 

<223> X » any amino acid 
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<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
(31) . . (31) 
X » any amino acid 



MISC_FEATURE 
(32) . . (32) 
X = any amino acid 



MIS C_ FEATURE 
(33) . . (33) 
X = any amino acid 



MISC_FEATURE 

(34) . . (34) 

X *=» any amino acid 



MI5C_FEATURE 
(35) . . (35) 
X « any amino acid 



MI SC_FEATURE 

(36) . . (36) 

X = any amino acid 



MISC_FEATURE 
(37) . . (37) 

X « any amino acid or is absent 



MI SC_FEATURE 

(40) . . (40) 

X ■» any amino acid 



MIS C_FEATU RE 
(42).. (42) 
X « any amino acid 



MI SC_FE ATU RE 
(43) . . (43) 
X « any amino acid 
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<220> 

<2 2 1 > MI SC_ FEAT ORE 

<222> (44). .(44) 

<223> X » D or E 



<220> 

<221> MISC_FEATURE 

<222> (45).. (45) 

<223> X - any amino acid 



<400> 3 



Gly Xaa Xaa His Xaa Xaa Glu His Xaa Xaa Xaa Xaa Gly Xaa Xaa Lys 
1 5 10 15 



Tyr Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa 
20 25 30 



Xaa Xaa Xaa Xaa Xaa Asn Ala Xaa Thr Xaa Xaa Xaa Xaa Thr 





35 40 




45 


<210> 


4 






<211> 


989 






<212> 


PRT 






<213> 


Saccharomyces cerevisiae 






<400> 


A 






Met Leu Arg Phe Gin Arg Phe Ala 


Ser Ser Tyr Ala 


Gin 


1 


5 


10 





15 



Val Arg Lys Tyr Pro Val Gly Gly He Phe His Gly Tyr Glu Val Arg 
20 25 30 



Arg He Leu Pro Val Pro Glu Leu Arg Leu Thr Ala Val Asp Leu Val 
35 4 0 4 5 



His Ser Gin Thr Gly Ala Glu His Leu His He Asp Arg Asp Asp Lys 
50 55 60 



Asn Asn Val Phe Ser He Ala Phe Lys Thr Asn Pro Pro Asp Ser Thr 
65 70 75 80 



Gly Val Pro His He Leu Glu His Thr Thr Leu Cys Gly Ser Val Lys 
85 90 95 
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Tyr Pro Val Arg Asp Pro Phe Phe Lys Met Leu Asn Lys Ser Leu Ala 
100 105 no 



Asn Phe Met Asn Ala Met Thr Gly Pro Asp Tyr Thr Phe Phe Pro Phe 
115 120 125 



Ser Thr Thr Asn Pro Gin Asp Phe Ala Asn Leu Arg Gly Val Tyr Leu 
130 135 140 



Asp Ser Thr Leu Asn Pro Leu Leu Lys Gin Glu Asp Phe Asp Gin Glu 
145 150 155 * 160 



Gly Trp Arg Leu Glu His Lys Asn He Thr Asp Pro Glu Ser Asn He 
165 170 175 

Val Phe Lys Gly Val Val Tyr Asn Glu Met Lys Gly Gin He Ser Asn 
180 185 190 



Ala Asn Tyr Tyr Phe Trp Ser Lys Phe Gin Gin Ser He Tyr Pro Ser 
195 200 205 



Leu Asn Asn Ser Gly Gly Asp Pro Met Lys He Thr Asp Leu Arg Tyr 
210 215 220 



Gly Asp Leu Leu Asp Phe His His Lys Asn Tyr His Pro Ser Asn Ala 
225 230 235 240 

Lys Thr Phe Thr Tyr Gly Asn Leu Pro Leu Val Asp Thr Leu Lys Gin 
245 250 255 



Leu Asn Glu Gin Phe Ser Gly Tyr Gly Lys Arg Ala Arg Lys Asp Lys 
260 265 270 



Leu Leu Met Pro He Asp Leu Lys Lys Asp He Asp Val Lys Leu Leu 

275 280 285 

Gly Gin He Asp Thr Met Leu Pro Pro Glu Lys Gin Thr Lys Ala Ser 
290 295 300 



Met Thr Trp lie Cys Gly Ala Pro Gin Asp Thr Tyr Asp Thr Phe Leu 
305 310 315 320 



Leu Lys Val Leu Gly Asn Leu Leu Met Asp Gly His Ser Ser Val Met 
325 330 335 
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Tyr Gin Lys Leu lie Glu Ser Gly lie Gly Leu Glu Phe Ser Val Asn 
340 345 350 



Ser Gly Val Glu Pro Thr Thr Ala Val Asn Leu Leu Thr Val Gly He 
355 360 365 



Gin Gly Val Ser Asp He Glu He Phe Lys Asp Thr Val Asn Asn He 
370 375 380 



Phe Gin Asn Leu Leu Glu Thr Glu His Pro Phe Asp Arg Lys Arg He 
385 390 395 400 



Asp Ala He He Glu Gin Leu Glu Leu Ser Lys Lys Asp Gin Lys Ala 
405 410 415 



Asp Phe Gly Leu Gin Leu Leu Tyr Ser He Leu Pro Gly Trp Thr Asn 
420 425 430 



Lys lie Asp Pro Phe Glu Ser Leu Leu Phe Glu Asp Val Leu Gin Arg 
435 440 445 

Phe Arg Gly Asp Leu Glu Thr Lys Gly Asp Thr Leu Phe Gin Asp Leu 
450 455 460 



He Arg Lys Tyr He Val His Lys Pro Cys Phe Thr Phe Ser He Gin 

465 470 475 480 

Gly Ser Glu Glu Phe Ser Lys Ser Leu Asp Asp Glu Glu Gin Thr Arg 

485 490 495 



Leu Arg Glu Lys He Thr Ala Leu Asp Glu Gin Asp Lys Lys Asn He 
500 505 510 

Phe Lys Arg Gly He Leu Leu Gin Glu Lys Gin Asn Glu Lys Glu Asp 
515 520 525 



Leu Ser Cys Leu Pro Thr Leu Gin He Lys Asp He Pro Arg Ala Gly 
530 535 540 



Asp Lys Tyr Ser He Glu Gin Lys Asn Asn Thr Met Ser Arg He Thr 
545 550 555 560 



Asp Thr Asn Gly He Thr Tyr Val Arg Gly Lys Arg Leu Leu Asn Asp 
565 570 " 575 
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He He Pro Phe GJLu Leu Phe Pro Tyr Leu Pro Leu Phe Ala Glu Ser 
580 585 590 



Leu Thr Asn Leu Gly Thr Thr Thr Glu Ser Phe Ser Glu He Glu Asp 
595 600 605 



Gin He Lys Leu His Thr Gly Gly He Ser Thr His Val Glu Val Thr 
610 615 620 



Ser Asp Pro Asn Thr Thr Glu Pro Arg Leu He Phe Gly Phe Asp Gly 
625 630 635 640 



Trp Ser Leu Asn Ser Lys Thr Asp His He Phe Glu Phe Trp Ser Lys 
64 5 650 655 



He Leu Leu Glu Thr Asp Phe His Lys Asn Ser Asp Lys Leu Lys Val 
660 665 670 



Leu He Arg Leu Leu Ala Ser Ser Asn Thr Ser Ser Val Ala Asp Ala 
675 680 685 

Gly His Ala Phe Ala Arg Gly Tyr Ser Ala Ala His Tyr Arg Ser Ser 
690 695 700 

Gly Ala He Asn Glu Thr Leu Asn Gly He Glu Gin Leu Gin Phe He 
705 710 715 720 



Asn Arg Leu His Ser Leu Leu Asp Asn Glu Glu Thr Phe Gin Arg Glu 
725 730 735 

Val Val Asp Lys Leu Thr Glu Leu Gin Lys Tyr He Val Asp Thr Asn 
740 745 * 750 

Asn Met Asn Phe Phe He Thr Ser Asp Ser Asp Val Gin Ala Lys Thr 

755 760 765 



Val Glu Ser Gin He Ser Lys Phe Met Glu Arg Leu Pro His Gly Ser 
770 775 780 



Cys Leu Pro Asn Gly Pro Lys Thr Ser Asp Tyr Pro Leu He Gly Ser 
785 790 795 800 



Lys Cys Lys His Thr Leu He Lys Phe Pro Phe Gin Val His Tyr Thr 
805 810 815 
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Ser Gin Ala Leu Leu Gly Val Pro Tyr Thr His Lys Asp Gly Ser Ala 
820 825 830 

Leu Gin Val Met Ser Asn Met Leu Thr Phe Lys His Leu His Arg Glu 
835 840 845 

Val Arg Glu Lys Gly Gly Ala Tyr Gly Gly Gly Ala Ser Tyr Ser Ala 
850 855 860 



Leu Ala Gly He Phe Ser Phe Tyr Ser Tyr Arg Asp Pro Gin Pro Leu 
865 870 875 880 

Lys Ser Leu Glu Thr Phe Lys Asn Ser Gly Arg Tyr He Leu Asn Asp 
885 890 895 

Ala Lys Trp Gly Val Thr Asp Leu Asp Glu Ala Lys Leu Thr He Phe 
900 905 910 

Gin Gin Val Asp Ala Pro Lys Ser Pro Lys Gly Glu Gly Val Thr Tyr 
915 920 925 

Phe Met Ser Gly Val Thr Asp Asp Met Lys Gin Ala Arg Arg Glu Gin 
930 935 940 

Leu Leu Asp Val Ser Leu Leu Asp Val His Arg Val Ala Glu Lys Tyr 
945 950 955 960 

Leu Leu Asn Lys Glu Gly Val Ser Thr Val He Gly Pro Gly He Glu 
965 970 975 

Gly Lys Thr Val Ser Pro Asn Trp Glu Val Lys Glu Leu 
980 985 



<210> 5 

<211> 882 

<212> PRT 

<213> Schizosaccharomyces pombe 

<400> 5 

Met Asn Tyr Ala Lys Leu Ser He Ala Phe Ser Lys Lys Thr He Lys 
1 5 10 " 15 



Thr His Asn Cys Arg Leu Phe Gin Arg Trp Leu His Val Gly Asp Lys 
20 25 30 
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Val His Asp Phe Arg Val Val Asp Thr Lys Lys Val Pro Glu Leu Gin 
35 40 45 



Leu Asn Tyr Thr Arg Leu Lys His Glu Pro Thr Asn Ala Asp Met He 
50 55 60 



His Leu Asp Arg Glu Asp Pro Asn Ser Val Phe Ser He Gly Phe Gin 
65 70 75 80 

Thr Pro Ala Glu Asn Asp Glu Gly He Pro His He Leu Glu His Thr 
85 90 95 

Thr Leu Cys Gly Ser Asn Lys Tyr Pro Val Arg Asp Pro Phe Phe Lys 

100 105 HO 



Met Leu Asn Arg Ser Leu Ala Thr Phe Met Asn Ala Phe Thr Ala Ser 
115 120 125 



Asp Phe Thr Phe Tyr Pro Phe Ala Thr Val Asn Thr Thr Asp Tyr Lys 
130 135 140 



Asn Leu Arg Asp Val Tyr Leu Asp Ala Thr Leu Phe Pro Lys Leu Arg 
145 150 155 160 

Lys Leu Asp Phe Leu Gin Glu Gly Trp Arg Phe Glu His Ala Asp Val 
165 170 175 

Asn Asp Lys Lys Ser Pro He He Phe Asn Gly Val Val Tyr Asn Glu 
180 185 190 

Met Lys Gly Gin Val Ser Asp Ser Ser Tyr He Phe Tyr Met Leu Phe 
195 200 205 

Gin Gin His Leu Phe Gin Gly Thr Ala Tyr Gly Phe Asn Ser Gly Gly 
210 215 220 

Asp Pro Leu Ala He Pro Asp Leu Lys Tyr Glu Glu Leu Val Lys Phe 
225 230 235 240 

His Arg Ser His Tyr His Pro Ser Asn Ala Lys He Leu Ser Tyr Gly 
245 250 255 



Ser Phe Pro Leu Glu Asp Asn Leu Ser Ala Leu Ser Glu Thr Phe Arg 
260 265 270 
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Pro Phe Ser Lys Arg Glu Leu Asn Leu Pro Asn Thr Phe Leu Lys Glu 
275 280 285 



Phe Asp Gin Glu Lys Arg Val Val Glu Tyr Gly Pro Leu Asp Pro Val 
290 295 300 



Met Ala Pro Gly Arg Gin Val Lys Thr Ser lie Ser Phe Leu Ala Asn 
305 310 315 320 

Asp Thr Ser Asn Val Tyr Glu Thr Phe Ala Leu Lys Val Leu Ser Lys 
325 330 335 

Leu Cys Phe Asp Gly Phe Ser Ser Pro Phe Tyr Lys Ala Leu lie Glu 
340 345 350 

Ser Gly Leu Gly Thr Asp Phe Ala Pro Asn Ser Gly Tyr Asp Ser Thr 
355 360 365 

Thr Lys Arg Gly He Phe Ser Val Gly Leu Glu Gly Ala Ser Glu Glu 
370 375 380 

Ser Leu Ala Lys He Glu Asn Leu Val Tyr Ser He Phe Asn Asp Leu 
385 390 395 400 

Ala Leu Lys Gly Phe Glu Asn Glu Lys Leu Glu Ala He Leu His Gin 
405 410 415 

Met Glu He Ser Leu Lys His Lys Ser Ala His Phe Gly He Gly Leu 
420 425 430 

Ala Gin Ser Leu Pro Phe Asn Trp Phe Asn Gly Ala Asp Pro Ala Asp 
435 440 445 

Trp Leu Ser Phe Asn Lys Gin He Glu Trp Leu Lys Gin Lys Asn Ser 
450 455 460 

Asp Gly Lys Leu Phe Gin Lys Leu He Lys Lys Tyr He Leu Glu Asn 
465 470 475 480 

Lys Ser Arg Phe Val Phe Thr Met Leu Pro Ser Ser Thr Phe Pro Gin 
485 490 495 



Arg Leu Gin Glu Ala Glu Ala Lys Lys Leu Gin Glu Arg Thr Ser Lys 
500 505 510 
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Leu Thr Asp Glu Asp He Ala Glu He Glu Lys Thr Ser Val Lys Leu 
515 520 525 



Leu Glu Ala Gin Ser Thr Pro Ala Asp Thr Ser Cys Leu Pro Thr Leu 
530 535 540 



Ser Val Ser Asp He Pro Glu Thr He Asp Glu Thr Lys Leu Lys Phe 
545 550 555 560 



Leu Asp He Ala Gly Met Lys Ala Gin Trp Tyr Asp Leu Ala Ala Gly 
565 570 57 5 



Leu Thr Tyr He Arg Leu Leu Leu Pro Leu Lys Asn Phe Pro Glu Ser 
580 585 590 



Leu He Pro Tyr Leu Pro Val Tyr Cys Asp Ala Cys Leu Asn Leu Gly 
595 600 605 



Thr His Ser Glu Ser He Gly Asp Leu Glu His Gin He Arg Arg Tyr 
610 615 620 



Thr Gly Gly He Ser He Ser Pro Ser Ala Val Thr Asn Asn Ser Asp 
625 630 635 640 

Val Ser Lys Tyr Glu Leu Gly He Ala He Ser Gly Tyr Ala Leu Asp 
645 650 655 



Lys Asn Val Gly Lys Leu Val Glu Leu He Asn Lys Ala Phe Trp Asn 
660 665 670 



Thr Asn Leu Ser Asn Thr Asp Lys Leu Ala He Met Leu Lys Thr Ser 
675 680 685 



Val Ser Gly He Thr Asp Gly He Ala Glu Lys Gly His Ser Phe Ala 
690 695 700 



Lys Val Ser Ser Ala Ser Gly Leu Thr Glu Lys Thr Ser lie Thr Glu 
705 710 715 720 



Gin Leu Gly Gly Leu Thr Gin Val Lys Leu Leu Ser Gin Leu Ser Arg 
725 730 735 



Glu Glu Ser Phe Gly Pro Leu Val Glu Lys Leu Thr Ala He Arg Glu 
740 745 750 



17/64 

He Leu Arg Gly Thr Ser Gly Phe Lys Ala Ala He Asn Ala Ser Pro 
755 760 765 

Thr Gin His Glu Val Val Glu Lys Ala Leu Gin Lys Phe Met Lys Ser 
770 775 780 

Arg Gly Val Asn Gin Gin Thr Gin Thr Lys Ser Thr Ser Lys Glu Arg 
785 790 795 800 

Asn Gly He Asn Ser He Lys Thr Tyr His Glu Leu Pro Phe Gin Thr 

805 810 815 



Tyr Phe Ala Ala Lys Ser Cys Leu Gly Val Pro Tyr Thr His Pro Asp 
820 825 830 

Gly Ala Pro Leu Gin He Leu Ser Ser Leu Leu Thr His Lys Tyr Leu 
835 840 845 

His Gly Glu He Arg Glu Lys Gly Gly Ala Tyr Gly Ala Gly Leu Ser 
850 855 860 



Tyr Ser Gly He Asp Gly Val Leu Ser Phe Phe Thr Tyr Arg Asp Ser 
865 870 875 880 

Asp Pro 



<210> 6 
<211> 973 
<212> PRT 

<213> Clostridium perfringens 
<4 00> 6 

Met Asn Phe Lys Glu Asn Asn He Tyr Ser Gly Phe Lys Leu Leu Asn 
15 10 15 

He Glu Asn Leu Asn Glu He Gly Gly Val Gly Leu Arg Phe Glu His 
20 25 30 

Glu Lys Thr Lys Ala Lys Leu He Lys He Leu Ser Glu Asp Asp Asn 
35 40 45 



Lys Cys Phe Ala He Gly Phe Arg Thr Pro Pro Glu Asn Ser Thr Gly 
50 55 60 
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Val Pro His lie Leu Glu His Ser Val Leu Cys Gly Ser Arg Lys Phe 
65 70 75 80 



Asn Thr Lys Glu Pro Phe Val Glu Leu Leu Lys Gly Ser Leu Asn Thr 
85 90 95 



Phe Leu Asn Ala Met Thr Tyr Pro Asp Lys Thr He Tyr Pro Val Ala 
100 105 110 



Ser Arg Asn Glu Lys Asp Phe Met Asn Leu Met Asp Val Tyr Leu Asp 
115 120 125 



Ala Val Leu Tyr Pro Asn He Tyr Lys His Lys Glu He Phe Met Gin 
130 135 140 



Glu Gly Trp His Tyr Tyr He Glu Asn Lys Glu Asp Glu Leu Lys Tyr 
145 150 155 160 



Asn Gly Val Val Tyr Asn Glu Met Lys Gly Ala Tyr Ser Ser Pro Asp 
165 170 175 

Ser He Leu Tyr Arg Lys He Pro Gin Thr lie Tyr Pro Asp Thr Cys 
180 185 190 



Tyr Ala Leu Ser Ser Gly Gly Asp Pro Asp Glu He Pro Asn Leu Thr 

195 200 205 

Tyr Glu Glu Phe Val Glu Phe His Lys Lys Tyr Tyr His Pro Ser Asn 
210 215 220 

Ser Tyr He Phe Leu Tyr Gly Asn Gly Asp Thr Glu Lys Glu Leu Glu 
225 230 235 240 



Phe He Asn Glu Glu Tyr Leu Lys Asn Phe Glu Tyr Lys Glu He Asp 
245 250 255 



Ser Glu He Lys Glu Gin Lys Ser Phe Glu Ser Met Lys Glu Glu Ser 
260 265 270 



Phe Thr Tyr Gly He Ala Glu Ser Glu Asp Leu Asn His Lys Ser Tyr 
275 2B0 285 



Tyr Ser Leu Asn Phe Val He Gly Asp Ala Thr Asp Gly Glu Lys Gly 
290 295 300 
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Leu Ala Phe Asp Val Leu Ala Tyr Leu Leu Thr Arg Ser Thr Ala Ala 
305 310 315 320 

Pro Leu Lys Lys Ala Leu lie Asp Ala Gly He Gly Lys Ala Val Ser 
325 330 335 

Gly Asp Phe Asp Asn Ser Thr Lys Gin Ser Ala Phe Thr Val Leu Val 
340 345 350 

Lys Asn Ala Glu Leu Asn Lys Glu Glu Glu Phe Lys Lys Val Val Met 
355 360 365 

Asp Thr Leu Lys Asp Leu Val Glu Asn Gly He Asp Lys Glu Leu He 
370 375 380 

Glu Ala Ser He Asn Arg Val Glu Phe Glu Leu Arg Glu Gly Asp Tyr 
385 390 395 400 

Gly Ser Tyr Pro Asn Gly Leu He Tyr Tyr Leu Lys Val Met Asp Ser 
405 410 415 

Trp Leu Tyr Asp Gly Asp Pro Tyr Val His Leu Glu Tyr Glu Lys Asn 
420 425 430 

Leu Glu Lys He Lys Ser Ala Leu Thr Ser Asn Tyr Phe Glu Asp Leu 
435 440 445 

He Glu Arg Tyr Met He Asn Asn Thr His Ser Ser Leu Val Ser Leu 
450 455 460 

His Pro Glu Lys Gly He Asn Glu Lys Lys Ser Ala Glu Leu Lys Lys 
465 470 475 480 

Lys Leu Glu Glu He Lys Asn Ser Phe Asp Glu Lys Thr Leu Asn Glu 
485 490 495 

He He Asp Asn Cys Lys Lys Leu Lys Glu Arg Gin Ser Thr Pro Asp 
500 505 510 

Lys Lys Glu Asp Leu Glu Ser He Pro Met Leu Ser Leu Glu Asp He 
515 520 525 



Asp Lys Glu Ala Thr Lys He Pro Thr Glu Glu Lys Glu He Asp Gly 
530 535 540 
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lie Thr Thr Leu His His Asp Phe His Thr Asn Lys He Asp Tyr Val 
545 550 555 560 

Asn Phe Phe Phe Asn Thr Asn Ser Val Pro Glu Asp Leu He Pro Tyr 
565 570 575 



Val Gly Leu Leu Cys Asp He Leu Gly Lys Cys Gly Thr Glu Asn Tyr 
580 585 590 



Asp Tyr Ser Lys Leu Ser Asn Ala He Asn He Ser Thr Gly Gly He 
595 600 605 

Ser Phe Gly Ala He Thr Phe Ala Asn Leu Lys Lys Asn Asn Glu Phe 
610 615 620 



Arg Pro Tyr Leu Glu He Ser Tyr Lys Ala Leu Ser Ser Lys Thr Asn 
625 630 635 640 



Lys Ala He Glu Leu Val Asp Glu He Val Asn His Thr Asp Leu Asp 
64 5 650 655 



Asp Met Asp Arg He Met Gin He He Arg Glu Lys Arg Ala Arg Leu 
660 665 670 

Glu Gly Ala He Phe Asp Ser Gly His Arg He Ala Met Lys Lys Val 
675 680 685 

Leu Ser Tyr Ser Thr Asn Arg Gly Ala Tyr Asp Glu Lys He Ser Gly 
690 695 700 



Leu Asp Tyr Tyr Asp Phe Leu Val Asn He Glu Lys Glu Asp Lys Lys 
705 710 715 720 

Ser Thr He Ser Asp Ser Leu Lys Lys Val Arg Asp Leu He Phe Asn 
725 730 735 

Lys Gly Asn Met Leu He Ser Tyr Ser Gly Lys Glu Glu Glu Tyr Glu 

740 745 750 



Asn Phe Lys Glu Lys Val Lys Tyr Leu He Ser Lys Thr Asn Asn Asn 
755 760 765 



Asp Phe Glu Lys Glu Glu Tyr Asn Phe Glu Leu Gly Lys Lys Asn Glu 
770 775 780 



21/64 

Gly Leu Leu Thr Gin Gly Asn Val Gin Tyr Val Ala Lys Gly Gly Asn 
785 790 795 800 



Tyr Lys Thr His Gly Tyr Lys Tyr Ser Gly Ala Leu Ser Leu Leu Glu 
805 810 815 

Ser lie Leu Gly Phe Asp Tyr Leu Trp Asn Ala Val Arg Val Lys Gly 
820 B25 830 



Gly Ala Tyr Gly Val Phe Ser Asn Phe Arg Arg Asp Gly Gly Ala Tyr 
835 840 845 



lie Val Ser Tyr Arg Asp Pro Asn He Lys Ser Thr Leu Glu Ala Tyr 
850 855 860 



Asp Asn He Pro Lys Tyr Leu Asn Asp Phe Glu Ala Asp Glu Arg Glu 
865 870 875 880 

Met Thr Lys Tyr He lie Gly Thr He Arg Lys Tyr Asp Gin Pro He 
885 890 895 

Ser Asn Gly He Lys Gly Asp lie Ala Val Ser Tyr Tyr Leu Ser Asn 
900 905 910 



Phe Thr Tyr Glu Asp Leu Gin Lys Glu Arg Glu Glu He He Asn Ala 
915 920 925 



Asp Val Glu Lys He Lys Ser Phe Ala Pro Met He Lys Asp Leu Met 
930 935 940 



Lys Glu Asp Tyr He Cys Val Leu Gly Asn Glu Glu Lys He Lys Glu 
945 950 955 " 960 



Asn Lys Asp Leu Phe Asn Asn He Lys Ser Val He Lys 
965 970 



<210> 7 

<211> 971 

<212> PRT 

<213> Borrelia burgdorferi 

<400> 7 



Met Lys Lys Lys Lys He Phe Lys Leu He Ser Lys Thr Tyr Leu Glu 
1 5 10 15 



22/64 

Glu His Asp Ala Glu Gly Tyr Tyr Phe Lys His Glu Ser Gly Leu Glu 
20 25 30 



Val Phe His Leu Lys Ser Asp Ser Phe Lys Glu Asn Ala Phe Cys lie 
35 40 45 



Ala Phe Lys Thr He Pro Ser Asn Asn Thr Gly Val Ala His Val Leu 
50 55 60 



Glu His Thr He Phe Cys Gly Ser Ser Lys Tyr Lys He Lys Asp Pro 

65 70 75 80 

Phe Leu Tyr Leu Leu Lys Gly Ser Leu Asn Thr Phe Leu Asn Ala Met 
85 90 95 



Thr Phe Pro Asp Lys Thr He Tyr Pro Ala Ala Ser Thr He Glu Lys 

100 105 HO 

Asp Tyr Phe Asn Leu Phe Asn He Tyr Ala Asp Ser He Phe Asn Pro 
115 120 125 

Leu Leu Lys Lys Glu Ser Phe Met Gin Glu Gly Tyr Asn He Asn Pro 
130 135 140 

Lys Asp Phe Lys Val Ser Gly He Val Phe Asn Glu Met Lys Gly Ser 
145 150 155 * 160 

Tyr Ser Asn Lys Asn Ser Leu He Asn Glu He Val Ser Ser Ser Leu 
165 170 175 

Phe Glu Glu Gly Ala Tyr Lys Tyr Asp Ser Gly Gly He Pro Thr Asn 

180 185 190 

lie He Asp Leu Thr Tyr Glu Ser Phe Leu Asp Phe Tyr Lys Lys Tyr 
195 200 205 

Tyr Thr Leu Glu Asn Cys Lys He Phe Leu Cys Gly Asn Thr Gin Thr 
210 215 220 



Glu Lys Asn Leu Asn Phe He Glu Lys Tyr He He Arg Pro Tyr Lys 
225 230 235 240 



Lys Glu Lys Ser Asn Val Asn He Asn He Glu Asn Val Lys Arg Trp 
245 250 255 



23/64 

Glu Lys Gly Lys Lys Leu Thr Tyr Lys He Pro Lys Glu Asn Asp Asn 
260 265 270 

Ser Leu Gly Val Tyr Thr He Asn Trp Leu Cys Thr Glu He Asn Asn 
275 280 285 

He Glu Asp Ser He Gly Leu Glu He Leu Ser Glu He Leu Leu Asp 
290 295 300 

Asp Ser Cys Ser Phe Thr He Asn He Leu Lys Ser Gly He Gly Glu 
305 310 315 ^ 320 

Asp He Ala His He Ser Gly He Asn Thr Asp Leu Lys Glu Ser He 
325 330 " 335 

Phe Ser Phe Gly Leu Gin Asn Val Val Glu Asn Lys Glu Lys Glu Phe 
340 345 350 

Lys Asn Leu Val Phe Ser Glu Leu Lys Asn Leu Val Lys Asn Lys He 
355 360 365 

Pro Lys Glu Leu He Lys Gly He Leu Phe Gly Tyr Glu Phe Ala Leu 
370 375 380 

Lys Glu Glu Lys Gly Gin Asn Phe Pro He Ala Leu Met He Lys Ser 
385 390 395 400 

Phe Lys Gly Trp Leu Asn Gly Leu His Pro He Lys Thr Leu Gin Thr 
405 410 415 

Ser Tyr Tyr He Asn Glu He Thr Asn Lys Leu Glu Lys Gly He Tyr 
420 425 430 

Tyr Phe Glu Asn Leu He Glu Lys Tyr Leu lie Phe Asn Asn His Tyr 
435 440 445 

Thr Leu He Ser Phe He Pro Ser His Asp Thr Glu Lys Glu Met Glu 
450 455 460 

Glu Glu He Glu Lys Lys Leu Met Ala Arg Glu lie Glu He Lys Gin 
46 5 470 475 480 



Asn Pro Glu Glu Phe Leu Gin Phe Lys Lys Asp Tyr Asn Gin Phe Lys 
485 490 495 



24/64 

Lys Tyr Gin Asn Lys Lys Asp Ser Lys Ala Asp lie Ala Lys Leu Pro 
500 505 510 



Leu Leu Lys lie Glu Asp Leu Pro Lys Gin lie Glu Lys Ser Leu Asp 
515 520 525 



Leu Asn Glu lie Lys Glu Leu Asn Leu His Ser Phe Lys Phe Lys Ser 
530 535 540 



Asn Asn lie Phe Asn Val Asn Leu Phe Phe Lys Leu Asp Phe Leu Glu 
545 550 555 ' 560 

Lys Glu Asp Tyr lie Tyr Leu Ser Leu Phe Lys Arg Ala Leu Gin Asp 
565 570 575 



Leu Ser Thr Lys Asn Tyr Ser Tyr lie Asn lie Asn Asn Lys He Gin 
580 585 590 

Asn Thr Leu Gly Gin He Asn He Ser Glu Ser Tyr Asp Glu Asp He 
595 600 605 



Asp Gly Asn He Leu Asn Ser Phe Asn He Ser Phe Lys Ser Phe Asn 
610 615 620 



Asn Lys Val Lys Glu Ser Phe Glu Leu He Lys Glu He Leu He Asn 

625 630 635 640 

lie Asn Phe His Asp Tyr Glu Arg Leu Lys Glu He Thr Leu Ser Leu 

64 5 650 655 

Lys Asn Asp Phe Lys Ser Leu Leu He Pro Lys Gly His Leu Leu Ala 

660 665 670 

Met Leu Arg Ser Lys Ser Lys Leu Lys Leu Asn Glu Tyr Leu Lys Glu 

675 680 685 

Leu Gin Asn Gly He Thr Gly Arg Glu Phe Trp Gin Lys Ala Lys Thr 

690 695 700 

Asp Thr Glu Ser Leu Lys Glu He Ala Asn Lys Leu Asp Asn Leu Lys 

70S 710 715 * 720 



Asn Lys He He Leu Lys Asn Asn Leu Ser Ala Leu He Met Gly Asn 
725 730 735 



25/64 

Thr Asp Asp He Leu Lys Asn Leu Glu Asn Glu Phe Phe Asn Leu Lys 
740 745 750 



Glu Ser Leu Glu Glu Ser Asn His Tyr Asn Gly Leu Leu Asn Leu Asp 
755 760 765 



Ala Asn Ser Lys Ala Leu Arg Glu He He He He Gin Ser Lys Val 

770 775 780 

Ala Phe Asn Ala He Cys Phe Pro Ser Tyr Lys He Asn Asp Glu Asn 

785 790 795 800 



Tyr Pro Lys Ala Asn Phe Leu Glu His Val Leu Arg Ser Gly He Phe 
805 810 815 



Trp Glu Lys He Arg Val Met Gly Gly Ala Tyr Gly Ala Ser Ala Ser 
820 825 830 

He Ala Asn Gly He Phe Ser Phe Ala Ser Tyr Arg Asp Pro Asn Phe 

835 840 845 

Thr Lys Thr Tyr Gin Ala Phe Glu Lys Ser Leu Glu Glu Leu Ala Asn 
850 855 860 



Asn Lys Met Thr Asp Asp Glu He Tyr Thr Tyr Leu He Gly Leu He 

865 870 875 " 880 

Gly Thr Asn He Tyr Val Lys Thr Lys Ala Thr Glu Ala Leu Gin Ser 

885 890 895 

Tyr Arg Arg Lys Met Leu Asn He Ser Asp Ser Leu Arg Gin Asp He 

900 905 910 

Arg Asn Ala Tyr Phe Thr He Thr Pro Gin Asp He Lys Glu He Ser 
915 920 925 

Thr Lys He Leu Thr Gin He Arg Gin His Asn Ser He Ala Ser Leu 
930 935 940 

Val Asn Asn Gin He Tyr Glu Glu Glu Lys Asn Asn Leu Glu Lys Leu 

945 950 955 960 



He Gly Lys Glu Tyr Ser Leu Lys Lys He Tyr 
965 * 970 



26/64 

<210> 8 
<211> 995 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 8 

Met Ser Ala Ser Lys Leu Trp Ser Cys Thr Glu Thr Val Leu Asn Gly 
15 10 15 



Gly lie Lys Leu Phe Leu Tyr Ser Ser Lys Asn Thr Lys Leu Arg Val 
20 25 30 



Ala He Gly Glu Val Pro Gly Pro Met Val His Gly Ala Val Ser Phe 
35 40 45 

Val Thr Glu Ala Asp Ser Asp Asp Gly Leu Pro His Thr Leu Glu His 
50 55 60 



Leu Val Phe Met Gly Ser Lys Lys Tyr Pro Phe Lys Gly Val Leu Asp 
65 70 75 80 

Val He Ala Asn Arg Cys Leu Ala Asp Gly Thr Asn Ala Trp Thr Asp 
85 90 95 

Thr Asp His Thr Ala Tyr Thr Leu Ser Thr Val Gly Ser Asp Gly Phe 
100 105 110 

Leu Lys Val Leu Pro Val Tyr lie Asn His Leu Leu Thr Pro Met Leu 
115 120 125 

Thr Ala Ser Gin Phe Ala Thr Glu Val His His He Thr Gly Glu Gly 
130 135 140 



Asn Asp Ala Gly Val Val Tyr Ser Glu Met Gin Asp His Glu Ser Glu 
145 150 155 160 

Met Glu Ser He Met Asp Arg Lys Thr Lys Glu Val He Tyr Pro Pro 
165 170 175 

Phe Asn Pro Tyr Ala Val Asp Thr Gly Gly Arg Leu Lys Asn Leu Arg 
180 185 190 



Glu Ser Cys Thr Leu Glu Lys Val Arg Asp Tyr His Lys Lys Phe Tyr 
195 200 205 



27/64 

His Leu Ser Asn Met Val Val Thr Val Cys Gly Met Val Asp His Asp 
210 215 220 



Gin Val Leu Glu lie Met Asn Asn Val Glu Asn Glu His Met Ser Thr 
225 230 235 240 

Val Pro Asp His Phe Pro Lys Pro Phe Ser Phe Ala Leu Ser Asp He 
245 250 255 



Lys Glu Ser Thr Val His Arg Val Glu Cys Pro Thr Asp Asp Ala Ser 
260 265 270 



Arg Gly Ala Val Glu Val Ala Trp Phe Ala His Ser Pro Ser Asp Leu 
275 280 285 

Glu Thr His Ser Ser Leu His Val Leu Phe Asp Tyr Leu Ser Asn Thr 
290 295 300 



Ser Val Ala Pro Leu Gin Lys Asp Phe He Leu Leu Glu Asp Pro Leu 

305 310 315 320 

Ala Ser Ser Val Ser Phe His He Ala Glu Gly Val Arg Cys Asp Leu 

325 330 335 



Arg Leu Asn Phe Ala Gly Val Pro Val Glu Lys Leu Asp Glu Cys Ala 
340 345 350 



Pro Lys Phe Phe Asp Lys Thr Val Arg Glu His Leu Glu Glu Ala Asn 
355 360 365 

Phe Asp Met Glu Arg Met Gly Tyr Leu He Asp Gin Thr He Leu Asn 
370 375 380 



Glu Leu Val Lys Leu Glu Thr Asn Ala Pro Lys Asp He Met Ser His 
385 390 395 400 

He He Gly His Gin Leu Phe Asp Asn Glu Asp Glu Glu Leu Phe Lys 
405 410 415 



Lys Arg Thr Asn Glu He Asp Phe Leu Lys Lys Leu Lys Ser Glu Pro 
420 425 " 430 



Ala Ser Tyr Trp Val Gin Leu Val Asn Lys Tyr Phe Thr Ala Pro Ser 
435 440 445 



28/64 

Ala Thr Val He Gly Val Pro Asn Glu Glu Leu Val Asp Lys He Ala 
450 455 460 



Glu Glu Glu Glu Lys Arg He Ala Ala Gin Cys Glu Lys Leu Gly Lys 
465 470 475 480 

Lys Gly Leu Glu Glu Ala Gly Lys Ser Leu Glu Ala Ala He Leu Glu 
485 490 495 

Asn Thr Ala Asn His Pro Ser Ala Glu Leu Leu Asp Gin Leu He Val 
500 505 510 



Lys Asp Leu Glu Ala Phe Asp Arg Phe Pro Val Gin Ser Leu Thr Ser 
515 520 525 



Asn Ser Pro Ser Leu Thr Pro Gin Gin Ser Thr Phe Leu Ala Gin Phe 
530 535 540 

Pro Phe His Ala Asn Leu His Asn Cys Pro Thr Lys Phe Val Glu He 
545 550 555 560 

Phe Phe Leu Leu Asp Ser Ser Asn Leu Ser He Glu Asp Arg Ser Tyr 
565 570 " 575 

Leu Phe Leu Tyr Thr Asp Leu Leu Phe Glu Ser Pro Ala Met lie Asp 
580 585 590 

Gly Val Leu Thr Ser Ala Asp Asp Val Ala Lys His Phe Thr Lys Asp 
595 600 605 

Leu He Asp His Ser He Gin Val Gly Val Ser Gly Leu Tyr Asp Arg 
610 615 620 

Phe Val Asn Leu Arg He Lys Val Gly Ala Asp Lys Tyr Pro Leu Leu 
625 630 635 640 

Ala Lys Trp Ala Gin He Phe Thr Gin Gly Val Val Phe Asp Pro Ser 
645 650 655 



Arg He His Gin Cys Ala Gin Lys Leu Ala Gly Glu Ala Arg Asp Arg 
660 665 670 



Lys Arg Asp Gly Cys Thr Val Ala Ser Thr Ala Val Ala Ser Met Val 
675 680 685 



29/64 

Tyr Gly Lys Asn Thr Asn Cys He Leu Phe Asp Glu Leu Val Leu Glu 
690 695 700 



Lys Leu His Glu Lys He Ser Lys Asp Val Met Lys Asn Pro Glu Ala 
705 710 715 720 



Val Leu Glu Lys Leu Glu Gin Val Arg Ser Ala Leu Phe Ser Asn Gly 
725 730 735 

Val Asn Ala His Phe Val Ala Asp Val Asp Ser He Asp Pro Lys Met 
740 745 750 



Leu Ser Ser Asp Leu Trp Thr Trp Val Gin Ala Asp Pro Arg Phe Gly 
755 760 765 



Pro Gly His Gin Phe Ser Ala Glu Ala Gly Glu Asn Val Ser Leu Glu 
770 775 780 



Leu Gly Lys Glu Leu Leu He Gly Val Gly Gly Ser Glu Ser Ser Phe 
785 790 795 800 



He Tyr Gin Thr Ser Phe Leu Asp Ala Asn Trp Asn Ser Glu Glu Leu 
805 810 815 

He Pro Ala Met He Phe Gly Gin Tyr Leu Ser Gin Cys Glu Gly Pro 
820 825 830 



Leu Trp Arg Ala He Arg Gly Asp Gly Leu Ala Tyr Gly Ala Asn Val 
835 840 845 



Phe Val Lys Pro Asp Arg Lys Gin He Thr Leu Ser Leu Tyr Arg Cys 

850 855 860 

Ala Gin Pro Ala Val Ala Tyr Glu Arg Thr Arg Asp He He Arg Lys 

865 870 875 880 

He Val Glu Ser Gly Glu He Ser Lys Ala Glu Phe Glu Gly Ala Lys 

885 890 895 



Arg Ser Thr Val Phe Glu Met Met Lys Arg Glu Gly Thr Val Ser Gly 
900 905 * ~ 910 



Ala Ala Lys He Ser He Leu Asn Asn Phe Arg Gin Thr Pro His Pro 
915 920 925 



30/64 

Phe Asn He Asp Leu Cys Arg Arg He Trp Asn Leu Thr Ser Glu Glu 
930 935 940 



Met Val Lys He Gly Gly Pro Pro Leu Ala Arg Leu Phe Asp Glu Lys 
945 950 955 960 



Cys Phe Val Arg Ser He Ala Val His Pro Ser Lys Leu Asn Glu Met 
965 970 975 



Lys Lys Ala Phe Pro Gly Ser Ser Lys He Lys He Ser Asp Leu Gin 
980 985 990 



Phe Ala Cys 
995 



<210> 9 
<211> 962 
<212> PRT 

<213> Escherichia coli 
<400> 9 

Met Pro Arg Ser Thr Trp Phe Lys Ala Leu Leu Leu Leu Val Ala Leu 
15 10 15 



Trp Ala Pro Leu Ser Gin Ala Glu Thr Gly Trp Gin Pro He Gin Glu 
20 25 30 

Thr He Arg Lys Ser Asp Lys Asp Asn Arg Gin Tyr Gin Ala He Arg 
35 40 45 



Leu Asp Asn Gly Met Val Val Leu Leu Val Ser Asp Pro Gin Ala Val 
50 55 60 



Lys Ser Leu Ser Ala Leu Val Val Pro Val Gly Ser Leu Glu Asp Pro 

65 70 75 80 

Glu Ala Tyr Gin Gly Leu Ala His Tyr Leu Glu His Met Ser Leu Met 

85 90 95 

Gly Ser Lys Lys Tyr Pro Gin Ala Asp Ser Leu Ala Glu Tyr Leu Lys 

100 105 110 



Met His Gly Gly Ser His Asn Ala Ser Thr Ala Pro Tyr Arg Thr Ala 
115 120 125 



31/64 



Phe Tyr Leu Glu Val Glu Asn Asp Ala Leu Pro Gly Ala Val Asp 
130 135 140 



Leu Ala Asp Ala lie Ala Glu Pro Leu Leu Asp Lys Lys Tyr Ala Glu 
145 150 155 160 



Arg Glu Arg Asn Ala Val Asn Ala Glu Leu Thr Met Ala Arg Thr Arg 
165 170 175 



Asp Gly Met Arg Met Ala Gin Val Ser Ala Glu Thr lie Asn Pro Ala 



His Pro Gly Ser Lys Phe Ser Gly Gly Asn Leu Glu Thr Leu Ser Asp 
195 200 205 



Lys Pro Gly Asn Pro Val Gin Gin Ala Leu Lys Asp Phe His Glu Lys 
210 215 220 



Tyr Tyr Ser Ala Asn Leu Met Lys Ala Val He Tyr Ser Asn Lys Pro 
225 230 235 240 



Leu Pro Glu Leu Ala Lys Met Ala Ala Asp Thr Phe Gly Arg Val Pro 
245 250 255 



Asn Lys Glu Ser Lys Lys Pro Glu He Thr Val Pro Val Val Thr Asp 
260 265 270 



Ala Gin Lys Gly He He He His Tyr Val Pro Ala Leu Pro Arg Lys 
275 280 285 



Val Leu Arg Val Glu Phe Arg He Asp Asn Asn Ser Ala Lys Phe Arg 
290 295 300 



Ser Lys Thr Asp Glu Leu He Thr Tyr Leu He Gly Asn Arg Ser Pro 
305 310 315 ~ 320 



Gly Thr Leu Ser Asp Trp Leu Gin Lys Gin Gly Leu Val Glu Gly He 
325 330 335 



Ser Ala Asn Ser Asp Pro He Val Asn Gly Asn Ser Gly Val Leu Ala 



180 



185 



190 



340 



345 



350 



He Ser Ala Ser Leu Thr Asp Lys Gly Leu Ala Asn Arg Asp Gin Val 
355 360 365 



32/64 

Val Ala Ala lie Phe Ser Tyr Leu Asn Leu Leu Arg Glu Lys Gly lie 
370 375 380 



Asp Lys Gin Tyr Phe Asp Glu Leu Ala Asn Val Leu Asp He Asp Phe 
385 390 395 400 

Arg Tyr Pro Ser He Thr Arg Asp Met Asp Tyr Val Glu Trp Leu Ala 
405 410 415 

Asp Thr Met He Arg Val Pro Val Glu His Thr Leu Asp Ala Val Asn 
420 425 430 

He Ala Asp Arg Tyr Asp Ala Lys Ala Val Lys Glu Arg Leu Ala Met 
435 440 445 

Met Thr Pro Gin Asn Ala Arg He Trp Tyr He Ser Pro Lys Glu Pro 
450 455 4 60 

His Asn Lys Thr Ala Tyr Phe Val Asp Ala Pro Tyr Gin Val Asp Lys 
465 470 475 " 480 

He Ser Ala Gin Thr Phe Ala Asp Trp Gin Lys Lys Ala Ala Asp He 
485 490 495 

Ala Leu Ser Leu Pro Glu Leu Asn Pro Tyr He Pro Asp Asp Phe Ser 
500 505 510 

Leu He Lys Ser Glu Lys Lys Tyr Asp His Pro Glu Leu He Val Asp 
515 520 525 

Glu Ser Asn Leu Arg Val Val Tyr Ala Pro Ser Arg Tyr Phe Ala Ser 
530 535 540 

Glu Pro Lys Ala Asp Val Ser Leu He Leu Arg Asn Pro Lys Ala Met 
545 550 555 560 

Asp Ser Ala Arg Asn Gin Val Met Phe Ala Leu Asn Asp Tyr Leu Ala 
565 570 575 



Gly Leu Ala Leu Asp Gin Leu Ser Asn Gin Ala Ser Val Gly Gly He 
580 585 590 



Ser Phe Ser Thr Asn Ala Asn Asn Gly Leu Met Val Asn Ala Asn Gly 
595 600 605 



33/64 

Tyr Thr Gin Arg Leu Pro Gin Leu Phe Gin Ala Leu Leu Glu Gly Tyr 
610 615 620 

Phe Ser Tyr Thr Ala Thr Glu Asp Gin Leu Glu Gin Ala Lys Ser Trp 
625 630 635 640 

Tyr Asn Gin Met Met Asp Ser Ala Glu Lys Gly Lys Ala Phe Glu Gin 
645 650 655 

Ala He Met Pro Ala Gin Met Leu Ser Gin Val Pro Tyr Phe Ser Arg 
660 665 670 

Asp Glu Arg Arg Lys He Leu Pro Ser He Thr Leu Lys Glu Val Leu 
675 680 685 

Ala Tyr Arg Asp Ala Leu Lys Ser Gly Ala Arg Pro Glu Phe Met Val 
690 695 700 

He Gly Asn Met Thr Glu Ala Gin Ala Thr Thr Leu Ala Arg Asp Val 
705 710 715 720 

Gin Lys Gin Leu Gly Ala Asp Gly Ser Glu Trp Cys Arg Asn Lys Asp 
725 730 - 735 

Val Val Val Asp Lys Lys Gin Ser Val He Phe Glu Lys Ala Gly Asn 
740 745 750 

Ser Thr Asp Ser Ala Leu Ala Ala Val Phe Val Pro Thr Gly Tyr Asp 
755 760 765 

Glu Tyr Thr Ser Ser Ala Tyr Ser Ser Leu Leu Gly Gin He Val Gin 
770 775 780 

Pro Trp Phe Tyr Asn Gin Leu Arg Thr Glu Glu Gin Leu Gly Tyr Ala 
785 790 795 800 

Val Phe Ala Phe Pro Met Ser Val Gly Arg Gin Trp Gly Met Gly Phe 
805 810 815 

Leu Leu Gin Ser Asn Asp Lys Gin Pro Ser Phe Leu Trp Glu Arg Tyr 
620 825 830 



Lys Ala Phe Phe Pro Thr Ala Glu Ala Lys Leu Arg Ala Met Lys Pro 
835 840 845 



34/64 

Asp Glu Phe Ala Gin He Gin Gin Ala Val He Thr Gin Met Leu Gin 
850 855 860 



Ala Pro Gin Thr Leu Gly Glu Glu Ala Ser Lys Leu Ser Lys Asp Phe 

865 870 875 ~* 880 

Asp Arg Gly Asn Met Arg Phe Asp Ser Arg Asp Lys He Val Ala Gin 

885 890 * 895 

He Lys Leu Leu Thr Pro Gin Lys Leu Ala Asp Phe Phe His Gin Ala 

900 905 910 



Val Val Glu Pro Gin Gly Met Ala He Leu Ser Gin He Ser Gly Ser 
915 920 925 



Gin Asn Gly Lys Ala Glu Tyr Val His Pro Glu Gly Trp Lys Val Trp 

930 935 940 

Glu Asn Val Ser Ala Leu Gin Gin Thr Met Pro Leu Met Ser Glu Lys 

945 950 955 960 



Asn Glu 



<210> 10 

<211> 1161 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Met Leu Arg Arg Val Ala Val Ala Ala Val Phe Ala Thr Gly Arg Lys 
15 10 15 

Leu Arg Cys Glu Ala Gly Arg Asp Val Thr Ala Val Gly Arg lie Glu 
20 25 30 

Ala Arg Gly Leu Cys Glu Glu Ser Ala Lys Pro Phe Pro Thr Leu Thr 
35 40 45 

Met Pro Gly Arg Asn Lys Ala Lys Ser Thr Cys Ser Cys Pro Asp Leu 
50 55 60 



Gin Pro Asn Gly Gin Asp Leu Gly Glu Ser Gly Arg Val Ala Arg Leu 
65 70 75 80 



35/64 

Gly Ala Asp Glu Ser Glu Glu Glu Gly Arg Ser Leu Ser Asn Val Gly 
85 90 95 



Asp Pro Glu lie He Lys Ser Pro Ser Asp Pro Lys Gin Tyr Arg Tyr 
100 105 110 



He Lys Leu Gin Asn Gly Leu Gin Ala Leu Leu He Ser Asp Leu Ser 
115 120 125 



Asn Val Glu Gly Lys Thr Gly Asn Ala Thr Asp Glu Glu Glu Glu Glu 
130 135 140 



Glu Glu Glu Glu Glu Glu Gly Glu Glu Glu Glu Glu Glu Glu Glu Asp 
145 150 155 160 



Asp Asp Asp Asp Asp Asp Glu Asp Ser Gly Ala Glu He Gin Asp Asp 
165 170 175 



Asp Glu Glu Gly Phe Asp Asp Glu Glu Glu Phe Asp Asp Asp Glu His 
180 185 190 



Asp Asp Asp Asp Leu Asp Asn Glu Glu Asn Glu Leu Glu Glu Leu Glu 
195 200 205 



Glu Arg Val Glu Ala Arg Lys Lys Thr Thr Glu Lys Gin Ser Ala Ala 
210 215 220 



Ala Leu Cys Val Gly Val Gly Ser Phe Ala Asp Pro Asp Asp Leu Pro 
225 230 235 & 240 



Gly Leu Ala His Phe Leu Glu His Met Val Phe Met Gly Ser Leu Lys 
245 250 * 255 



Tyr Pro Asp Glu Asn Gly Phe Asp Ala Phe Leu Lys Lys His Gly Gly 
260 265 270 



Ser Asp Asn Ala Ser Thr Asp Cys Glu Arg Thr Val Phe Gin Phe Asp 
275 280 285 



Val Gin Arg Lys Tyr Phe Lys Glu Ala Leu Asp Arg Trp Ala Gin Phe 
290 295 300 



Phe He His Pro Leu Met He Arg Asp Ala He Asp Arg Glu Val Glu 
305 310 315 320 



36/64 

Ala Val Asp Ser Glu Tyr Gin Leu Ala Arg Pro Ser Asp Ala Asn Arg 
325 330 335 

Lys Glu Met Leu Phe Gly Ser Leu Ala Arg Pro Gly His Pro Met Gly 
340 345 350 



Lys Phe Phe Trp Gly Asn Ala Glu Thr Leu Lys His Glu Pro Lys Lys 
355 360 365 



Asn Asn lie Asp Thr His Ala Arg Leu Arg Glu Phe Trp Met Arg Tyr 
370 375 380 



Tyr Ser Ala His Tyr Met Thr Leu Val Val Gin Ser Lys Glu Thr Leu 
385 390 395 " 400 



Asp Thr Leu Glu Lys Trp Val Thr Glu lie Phe Ser Gin He Pro Asn 
405 410 415 



Asn Gly Leu Pro Lys Pro Asn Phe Ser His Leu Thr Asp Pro Phe Asp 
420 425 ^ 430 



Thr Pro Ala Phe Asn Lys Leu Tyr Arg Val Val Pro He Arg Lys He 

435 440 445 

His Ala Leu Thr He Thr Trp Ala Leu Pro Pro Gin Gin Gin His Tyr 

450 455 460 

Arg Val Lys Pro Leu His Tyr He Ser Trp Leu Val Gly His Glu Gly 

465 470 475 480 



Lys Gly Ser He Leu Ser Tyr Leu Arg Lys Lys Cys Trp Ala Leu Ala 
485 490 495 

Leu Phe Gly Gly Asn Gly Glu Thr Gly Phe Glu Gin Asn Ser Thr Tyr 
500 505 510 

Ser Val Phe Ser He Ser He Thr Leu Thr Asp Glu Gly Tyr Glu His 
515 520 525 



Phe Tyr Glu Val Ala His Thr Val Phe Gin Tyr Leu Lys Met Leu Gin 
530 535 540 



Lys Leu Gly Pro Glu Lys Arg Val Phe Glu Glu He Gin Lys lie Glu 
545 550 555 560 



37/64 

Asp Asn Glu Phe His Tyr Gin Glu Gin Thr Asp Pro Val Glu Tyr Val 
565 570 575 

Glu Asn Met Cys Glu Asn Met Gin Leu Tyr Pro Arg Gin Asp Phe Leu 
580 585 590 

Thr Gly Asp Gin Leu Leu Phe Glu Tyr Lys Pro Glu Val He Ala Glu 
595 600 605 

Ala Leu Asn Gin Leu Val Pro Gin Lys Ala Asn Leu Val Leu Leu Ser 
610 615 620 



Gly Ala Asn Glu Gly Arg Cys Asp Leu Lys Glu Lys Trp Phe Gly Thr 
625 630 635 640 

Gin Tyr Ser He Glu Asp lie Glu Asn Ser Trp Thr Glu Leu Trp Lys 
645 650 655 



Ser Asn Phe Asp Leu Asn Ser Asp Leu His Leu Pro Ala Glu Asn Lys 
660 665 670 

Tyr He Ala Thr Asp Phe Thr Leu Lys Ala Phe Asp Cys Pro Glu Thr 
675 680 685 

Glu Tyr Pro Ala Lys He Val Asn Thr Pro Gin Gly Cys Leu Trp Tyr 
690 695 700 

Lys Lys Asp Asn Lys Phe Lys He Pro Lys Ala Tyr He Arg Phe His 
7 °5 710 715 " 720 

Leu He Ser Pro Leu He Gin Lys Ser Ala Ala Asn Val Val Leu Phe 
725 730 735 

Asp He Phe Val Asn He Leu Thr His Asn Leu Ala Glu Pro Ala Tyr 
740 745 750 

Glu Ala Asp Val Ala Gin Leu Glu Tyr Lys Leu Val Ala Gly Glu His 
755 760 765 

Gly Leu He He Arg Val Lys Gly Phe Asn His Lys Leu Pro Leu Leu 
770 775 780 



Phe Gin Leu He He Asp Tyr Leu Thr Glu Phe Ser Ser Thr Pro Ala 
785 790 795 800 



3B/64 

Val Phe Thr Met lie Thr Glu Gin Leu Lys Lys Thr Tyr Phe Asn He 
805 810 J ~ 815 

Leu He Lys Pro Glu Thr Leu Ala Lys Asp Val Arg Leu Leu He Leu 
820 825 830 

Glu Tyr Ser Arg Trp Ser Met He Asp Lys Tyr Arg Ala Leu Met Asp 
835 840 ~ 845 

Gly Leu Ser Leu Glu Ser Leu Leu Asn Phe Val Lys Asp Phe Lys Ser 
850 855 860 

Gin Leu Phe Val Glu Gly Leu Val Gin Gly Asn Val Thr Ser Thr Glu 
865 870 875 880 

Ser Met Asp Phe Leu Arg Tyr Val Val Asp Lys Leu Asn Phe Val Pro 
885 890 895 

Leu Glu Arg Glu Met Pro Val Gin Phe Gin Val Val Glu Leu Pro Ser 
900 905 910 

Gly His His Leu Cys Lys Val Arg Ala Leu Asn Lys Gly Asp Ala Asn 
915 920 925 

Ser Glu Val Thr Val Tyr Tyr Gin Ser Gly Thr Arg Ser Leu Arg Glu 
930 935 940 

Tyr Thr Leu Met Glu Leu Leu Val Met His Met Glu Glu Pro Cys Phe 
945 950 955 960 

Asp Phe Leu Arg Thr Lys Gin Thr Leu Gly Tyr His Val Tyr Pro Thr 
965 970 975 

Cys Arg Asn Thr Ser Gly lie Leu Gly Phe Ser Val Thr Val Gly Thr 
980 985 990 

Gin Ala Thr Lys Tyr Asn Ser Glu Thr Val Asp Lys Lys He Glu Glu 
995 1000 1005 

Phe Leu Ser Ser Phe Glu Glu Lys He Glu Asn Leu Thr Glu Asp 
1010 1015 1020 



Ala Phe Asn Thr Gin Val Thr Ala Leu He Lys Leu Lys Glu Cys 
1025 1030 1035 



39/64 

Glu Asp Thr His Leu Gly Glu Glu Val Asp Arg Asn Trp Asn Glu 
1040 1045 1050 



Val Val Thr Gin Gin Tyr Leu Phe Asp Arg Leu Ala His Glu He 
1055 1060 1065 



Glu Ala Leu Lys Ser Phe Ser Lys Ser Asp Leu Val Ser Trp Phe 
1070 1075 1080 



Lys Ala His Arg Gly Pro Gly Ser Lys Met Leu Ser Val His Val 
1085 1090 1095 



Val Gly Tyr Gly Lys Tyr Glu Leu Glu Glu Asp Gly Ala Pro Val 
1100 1105 1110 



Cys Glu Asp Pro Asn Ser Arg Glu Gly Met Gin Leu He Tyr Leu 
1115 1120 1125 



Pro Pro Ser Pro Leu Leu Ala Glu Ser Thr Thr Pro He Thr Asp 
1130 1135 1140 



He Arg Ala Phe Thr Ala Thr Leu Ser Leu Phe Pro Tyr His Lys 
1145 1150 1155 



He Val Lys 
1160 



<210> 11 

<211> 1019 

<212> PRT 

<213> Homo sapiens 

<400> 11 

Met Arg Tyr Arg Leu Ala Trp Leu Leu His Pro Ala Leu Pro Ser Thr 
15 10 15 

Phe Arg Ser Val Leu Gly Ala Arg Leu Pro Pro Pro Glu Arg Leu Cys 
20 25 30 

Gly Phe Gin Lys Lys Thr Tyr Ser Lys Met Asn Asn Pro Ala He Lys 
35 40 45 



Arg He Gly Asn His He Thr Lys Ser Pro Glu Asp Lys Arg Glu Tyr 
50 55 60 
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Arg Gly Leu Glu Leu Ala Asn Gly lie Lys Val Leu Leu Met Ser Asp 
65 70 75 80 

Pro Thr Thr Asp Lys Ser Ser Ala Ala Leu Asp Val His lie Gly Ser 
85 90 95 

Leu Ser Asp Pro Pro Asn lie Ala Gly Leu Ser His Phe Cys Glu His 
100 105 no 

Met Leu Phe Leu Gly Thr Lys Lys Tyr Pro Lys Glu Asn Glu Tyr Ser 
115 120 125 

Gin Phe Leu Ser Glu His Ala Gly Ser Ser Asn Ala Phe Thr Ser Gly 
130 135 140 

Glu His Thr Asn Tyr Tyr Phe Asp Val Ser His Glu His Leu Glu Gly 
145 150 155 160 

Ala Leu Asp Arg Phe Ala Gin Phe Phe Leu Cys Pro Leu Phe Asp Glu 
165 170 175 

Ser Cys Lys Asp Arg Glu Val Asn Ala Val Asp Ser Glu His Glu Lys 
180 185 190 

Asn Val Met Asn Asp Ala Trp Arg Leu Phe Gin Leu Glu Lys Ala Thr 
195 200 205 

Gly Asn Pro Lys His Pro Phe Ser Lys Phe Gly Thr Gly Asn Lys Tyr 
210 215 220 

Thr Leu Glu Thr Arg Pro Asn Gin Glu Gly lie Asp Val Arg Gin Glu 
22 5 230 235 240 

Leu Leu Lys Phe His Ser Ala Tyr Tyr Ser Ser Asn Leu Met Ala Val 
245 250 255 

Cys Val Leu Gly Arg Glu Ser Leu Asp Asp Leu Thr Asn Leu Val Val 
260 265 270 

Lys Leu Phe Ser Glu Val Glu Asn Lys Asn Val Pro Leu Pro Glu Phe 
275 280 285 



Pro Glu His Pro Phe Gin Glu Glu His Leu Lys Gin Leu Tyr Lys He 
2^0 295 300 
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Val Pro He Lys Asp lie Arg Asn Leu Tyr Val Thr Phe Pro He Pro 
305 310 315 320 



Asp Leu Gin Lys Tyr Tyr Lys Ser Asn Pro Gly His Tyr Leu Gly His 
325 330 - 335 



Leu He Gly His Glu Gly Pro Gly Ser Leu Leu Ser Glu Leu Lys Ser 

340 345 350 

Lys Gly Trp Val Asn Thr Leu Val Gly Gly Gin Lys Glu Gly Ala Arg 

355 360 365 



Gly Phe Met Phe Phe He lie Asn Val Asp Leu Thr Glu Glu Gly Leu 
370 375 380 



Leu His Val Glu Asp He He Leu His Met Phe Gin Tyr He Gin Lys 

385 390 395 "* 400 

Leu Arg Ala Glu Gly Pro Gin Glu Trp Val Phe Gin Glu Cys Lys Asp 
405 410 415 

Leu Asn Ala Val Ala Phe Arg Phe Lys Asp Lys Glu Arg Pro Arg Gly 

420 425 430 



Tyr Thr Ser Lys He Ala Gly He Leu His Tyr Tyr Pro Leu Glu Glu 
435 440 445 

Val Leu Thr Ala Glu Tyr Leu Leu Glu Glu Phe Arg Pro Asp Leu He 
450 455 460 

Glu Met Val Leu Asp Lys Leu Arg Pro Glu Asn Val Arg Val Ala He 
465 470 475 480 

Val Ser Lys Ser Phe Glu Gly Lys Thr Asp Arg Thr Glu Glu Trp Tyr 
485 490 495 

Gly Thr Gin Tyr Lys Gin Glu Ala He Pro Asp Glu Val He Lys Lys 
500 505 510 



Trp Gin Asn Ala Asp Leu Asn Gly Lys Phe Lys Leu Pro Thr Lys Asn 
515 520 525 



Glu Phe He Pro Thr Asn Phe Glu He Leu Pro Leu Glu Lys Glu Ala 
530 535 540 
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Thr Pro Tyr Pro Ala Leu lie Lys Asp Thr Val Met Ser Lys Leu Trp 
545 550 555 560 

Phe Lys Gin Asp Asp Lys Lys Lys Lys Pro Lys Ala Cys Leu Asn Phe 
565 570 575 

Glu Phe Phe Ser Pro Phe Ala Tyr Val Asp Pro Leu His Cys Asn Met 
580 585 590 

Ala Tyr Leu Tyr Leu Glu Leu Leu Lys Asp Ser Leu Asn Glu Tyr Ala 
595 600 605 

Tyr Ala Ala Glu Leu Ala Gly Leu Ser Tyr Asp Leu Gin Asn Thr He 
610 615 620 

Tyr Gly Met Tyr Leu Ser Val Lys Gly Tyr Asn Asp Lys Gin Pro He 
625 630 635 640 

Leu Leu Lys Lys He He Glu Lys Met Ala Thr Phe Glu He Asp Glu 
645 650 655 

Lys Arg Phe Glu He He Lys Glu Ala Tyr Met Arg Ser Leu Asn Asn 
660 665 670 

Phe Arg Ala Glu Gin Pro His Gin His Ala Met Tyr Tyr Leu Arg Leu 
675 680 685 

Leu Met Thr Glu Val Ala Trp Thr Lys Asp Glu Leu Lys Glu Ala Leu 
690 695 700 

Asp Asp Val Thr Leu Pro Arg Leu Lys Ala Phe He Pro Gin Leu Leu 
705 710 715 720 

Ser Arg Leu His He Glu Ala Leu Leu His Gly Asn He Thr Lys Gin 
725 730 735 

Ala Ala Leu Gly He Met Gin Met Val Glu Asp Thr Leu He Glu His 
740 745 750 

Ala His Thr Lys Pro Leu Leu Pro Ser Gin Leu Val Arg Tyr Arg Glu 
755 760 765 



Val Gin Leu Pro Asp Arg Gly Trp Phe Val Tyr Gin Gin Arg Asn Glu 
770 775 780 
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Val His Asn Asn Cys Gly lie Glu He Tyr Tyr Gin Thr Asp Met Gin 
7 85 790 795 800 

Ser Thr Ser Glu Asn Met Phe Leu Glu Leu Phe Cys Gin He He Ser 
805 810 815 

Glu Pro Cys Phe Asn Thr Leu Arg Thr Lys Glu Gin Leu Gly Tyr He 
820 825 830 

Val Phe Ser Gly Pro Arg Arg Ala Asn Gly He Gin Ser Leu Arg Phe 
835 840 845 

He lie Gin Ser Glu Lys Pro Pro His Tyr Leu Glu Ser Arg Val Glu 
850 855 860 

Ala Phe Leu He Thr Met Glu Lys Ser lie Glu Asp Met Thr Glu Glu 
865 870 875 880 

Ala Phe Gin Lys His He Gin Ala Leu Ala He Arg Arg Leu Asp Lys 
885 890 ~ 895 

Pro Lys Lys Leu Ser Ala Glu Cys Ala Lys Tyr Trp Gly Glu He He 
900 905 910 

Ser Gin Gin Tyr Asn Phe Asp Arg Asp Asn Thr Glu Val Ala Tyr Leu 
915 920 925 

Lys Thr Leu Thr Lys Glu Asp He He Lys Phe Tyr Lys Glu Met Leu 
930 935 940 

Ala Val Asp Ala Pro Arg Arg His Lys Val Ser Val His Val Leu Ala 
945 950 955 960 

Arg Glu Met Asp Ser Cys Pro Val Val Gly Glu Phe Pro Cys Gin Asn 
965 970 975 

Asp lie Asn Leu Ser Gin Ala Pro Ala Leu Pro Gin Pro Glu Val He 
980 985 990 

Gin Asn Met Thr Glu Phe Lys Arg Gly Leu Pro Leu Phe Pro Leu Val 
995 1000 1005 



Lys Pro His He Asn Phe Met Ala Ala Lys Leu 
1010 1015 
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<210> 12 
<211> 1265 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 12 

Met Ala Ser Ser Ser Ser Ser He Phe Thr Gly Val Lys Phe Ser Pro 
1 5 10 15 

He Leu Ala Pro Phe Asn Ser Gly Asp Ser Arg Arg Ser Arg Tyr Leu 
20 25 30 



Lys Asp Ser Arg Asn Lys Val Arg Phe Asn Pro Ser Ser Pro Arg Leu 
35 40 45 



Thr Pro His Arg Val Arg Val Glu Ala Pro Ser Leu lie Pro Tyr Asn 
50 55 60 



Gly Leu Trp Ala Ala Gin Pro Asn Ser His Lys Gly Arg Leu Lys Arg 
65 70 75 "* 80 



Asn He Val Ser Gly Lys Glu Ala Thr Gly He Ser Leu Ser Gin Gly 
85 90 95 



Arg Asn Phe Cys Leu Thr Cys Lys Arg Asn Gin Ala Gly He Arg Arg 
100 105 HO 



Ala Leu Pro Ser Ala Phe Val Asp Arg Thr Ala Phe Ser Leu Ser Arg 
115 120 125 



Ser Ser Leu Thr Ser Ser Leu Arg Lys His Ser Gin He Val Asn Ala 
130 135 140 



Thr Leu Gly Pro Asp Glu Pro His Ala Ala Gly Thr Ala Trp Pro Asp 
145 150 155 160 

Gly lie Val Ala Glu Arg Gin Asp Leu Asp Leu Leu Pro Pro Glu He 
165 170 175 

Asp Ser Ala Glu Leu Glu Ala Phe Leu Gly Cys Glu Leu Pro Ser His 
180 185 190 



Pro Lys Leu His Arg Gly Gin Leu Lys Asn Gly Leu Arg Tyr Leu He 
195 200 205 



45/64 

Leu Pro Asn Lys Val Pro Pro Asn Arg Phe Glu Ala His Met Glu Val 
210 215 220 

His Val Gly Ser lie Asp Glu Glu Glu Asp Glu Gin Gly lie Ala His 
225 230 235 - 240 

Met lie Glu His Val Ala Phe Leu Gly Ser Lys Lys Arg Glu Lys Leu 
245 250 ~ 255 

Leu Gly Thr Gly Ala Arg Ser Asn Ala Tyr Thr Asp Phe His His Thr 
260 265 270 

Val Phe His lie His Ser Pro Thr His Thr Lys Asp Ser Glu Asp Asp 
275 280 285 

Leu Phe Pro Ser Val Leu Asp Ala Leu Asn Glu lie Ala Phe His Pro 
290 295 300 

Lys Phe Leu Ser Ser Arg Val Glu Lys Glu Arg Arg Ala He Leu Ser 
305 310 315 320 

Glu Leu Gin Met Met Asn Thr He Glu Tyr Arg Val Asp Cys Gin Leu 
325 330 335 

Leu Gin His Leu His Ser Glu Asn Lys Leu Gly Arg Arg Phe Pro He 
340 345 350 

Gly Leu Glu Glu Gin He Lys Lys Trp Asp Val Asp Lys He Arq Lys 
355 360 365 

Phe His Glu Arg Trp Tyr Phe Pro Ala Asn Ala Thr Leu Tyr He Val 
370 375 380 

Gly Asp He Asp Asn He Pro Arg lie Val His Asn He Glu Ala Val 
385 390 395 4 00 

Phe Gly Lys Asn Gly Leu Asp Asn Glu Ser Thr Pro Ser Ser Pro Ser 
405 410 415 

Pro Gly Ala Phe Gly Ala Met Ala Asn Phe Leu Val Pro Lys Leu Pro 
420 425 430 



Ala Gly Leu Gly Gly Thr Phe Ser Asn Glu Lys Thr Asn Thr Ala Asp 
435 440 445 
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Gin Ser Lys Met lie Lys Arg Glu Arg His Ala lie Arg Pro Pro Val 
450 455 460 

Glu His Asn Trp Ser Leu Pro Gly Thr Ser Val Asp Leu Lys Pro Pro 
465 470 475 480 

Gin He Phe Lys His Glu Leu Leu Gin Asn Phe Ala He Asn Met Phe 
485 490 495 

Cys Lys He Pro Val Ser Lys Val Gin Thr Phe Gly Asp Leu Arg Asn 
500 505 510 

Val Leu Met Lys Arg He Phe Leu Ser Ala Leu His Phe Arg He Asn 
515 520 525 

Thr Arg Tyr Lys Ser Ser Asn Pro Pro Phe Thr Ser Val Glu Leu Asp 
530 535 540 

His Ser Asp Ser Gly Arg Glu Gly Cys Thr Val Thr Thr Leu Thr Val 
54 5 550 555 560 

Thr Ala Glu Pro Gin Asn Trp Gin Asn Ala Val Lys Val Ala Val Gin 
565 570 575 

Glu Val Arg Arg Leu Lys Glu Phe Gly Val Thr Arg Gly Glu Leu Thr 
580 585 590 

Arg Tyr Met Asp Ala Leu Leu Lys Asp Ser Glu His Leu Ala Ala Met 
595 600 605 

He Asp Asn Val Ser Ser Val Asp Asn Leu Asp Phe He Met Glu Ser 
610 615 620 

Asp Ala Leu Ser His Thr Val Met Asp Gin Thr Gin Gly His Glu Thr 
625 630 635 640 

Leu Val Ala Val Ala Gly Thr Val Thr Leu Glu Glu Val Asn Thr Val 
645 650 655 

Gly Ala Lys Val Leu Glu Phe He Ser Asp Phe Gly Arg Pro Thr Ala 
660 665 670 



Leu Leu Pro Ala Ala He Val Ala Cys Val Pro Thr Lys Val His Val 
675 680 685 
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Asp Gly Val Gly Glu Ser Asp Phe Asn lie Ser Pro Asp Glu He He 
690 695 700 

Glu Ser Val Lys Ser Gly Leu Leu Ala Pro He Glu Ala Glu Pro Glu 
705 710 715 720 

Leu Glu Val Pro Lys Glu Leu He Ser Gin Ser Gin Leu Lys Glu Leu 
725 730 735 

Thr Leu Gin Arg Asn Pro Cys Phe Val Pro He Pro Gly Ser Gly Leu 
74 0 745 750 

Thr Lys Leu His Asp Lys Glu Thr Gly He Thr Gin Leu Arg Leu Ser 
755 760 765 

Asn Gly He Ala Val Asn Tyr Lys Lys Ser Thr Thr Glu Ser Arg Ala 
770 775 780 

Gly Val Met Arg Leu lie Val Gly Gly Gly Arg Ala Ala Glu Thr Ser 
785 790 795 800 

Asp Ser Lys Gly Ala Val Val Val Gly Val Arg Thr Leu Ser Glu Gly 
805 810 815 

Gly Arg Val Gly Asp Phe Ser Arg Glu Gin Val Glu Leu Phe Cys Val 
820 825 830 

Asn His Leu He Asn Cys Ser Leu Glu Ser Thr Glu Glu Phe He Ala 
835 840 845 

Met Glu Phe Arg Phe Thr Leu Arg Asp Asn Gly Met Gin Ala Ala Phe 
850 855 860 

Gin Leu Leu His Met Val Leu Glu Arg Ser Val Trp Leu Glu Asp Ala 
865 870 875 880 

Phe Asp Arg Ala Arg Gin Leu Tyr Leu Ser Tyr Phe Arg Ser He Pro 
885 890 895 

Lys Ser Leu Glu Arg Ala Thr Ala His Lys Leu Met He Ala Met Leu 
900 905 910 



Asn Gly Asp Glu Arg Phe Val Glu Pro Thr Pro Lys Ser Leu Gin Ser 
915 920 925 
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Leu Asn Leu Glu Ser Val Lys Asp Ala Val Met Ser His Phe Val Gly 
930 935 940 

Asp Asn Met Glu Val Ser lie Val Gly Asp Phe Ser Glu Glu Glu lie 
945 950 955 960 

Glu Arg Cys He Leu Asp Tyr Leu Gly Thr Val Lys Ala Ser His Asp 
965 970 975 

Ser Ala Lys Pro Pro Gly Ser Glu Pro He Leu Phe Arg Gin Pro Thr 
980 985 990 

Ala Gly Leu Gin Phe Gin Gin Val Phe Leu Lys Asp Thr Asp Glu Arg 
995 1000 * 1005 

Ala Cys Ala Tyr He Ala Gly Pro Ala Pro Asn Arg Trp Gly Phe 
1010 1015 1020 

Thr Val Asp Gly Asp Asp Leu Phe Gin Ser Val Ser Lys Leu Pro 
1025 1030 1035 

Val Ala His Asp Gly Leu Leu Lys Ser Glu Glu Gin Leu Leu Glu 
1040 1045 1050 

Gly Gly Asp Arg Glu Leu Gin Lys Lys Leu Arg Ala His Pro Leu 
1055 1060 1065 

Phe Phe Gly Val Thr Met Gly Leu Leu Ala Glu He He Asn Ser 
1070 1075 1080 

Arg Leu Phe Thr Thr Val Arg Asp Ser Leu Gly Leu Thr Tyr Asp 
1085 1090 1095 

Val Ser Phe Glu Leu Asn Leu Phe Asp Arg Leu Lys Leu Gly Trp 
1100 H05 1110 

Tyr Val He Ser Val Thr Ser Thr Pro Gly Lys Val Tyr Lys Ala 
1115 1120 1125 

Val Asp Ala Cys Lys Asn Val Leu Arg Gly Leu His Ser Asn Gin 
1130 1135 1140 



lie Ala Pro Arg Glu Leu Asp Arg Ala Lys Arg Thr Leu Leu Met 
1145 H50 1155 



• 
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Arg His Glu Ala Glu Leu Lys Ser Asn Ala Tyr Trp Leu Asn Leu 
1160 1165 * 1170 



Leu Ala His Leu Gin Ala Ser Ser Val Gin Arg Lys Glu Leu Ser 
1175 1180 1185 



Cys lie Lys Glu Leu Val Ser Leu Tyr Glu Ala Ala Ser lie Glu 
1190 1195 1200 



Asp lie Tyr Leu Ala Tyr Asn Gin Leu Arg Val Asp Glu Asp Ser 
1205 1210 1215 



Leu Tyr Ser Cys lie Gly lie Ala Gly Ala Gin Ala Gly Glu Glu 
1220 1225 1230 



lie Thr Val Leu Ser Glu Glu Glu Glu Pro Glu Asp Val Phe Ser 
1235 1240 1245 



Gly Val Val Pro Val Gly Arg Gly Ser Ser Met Thr Thr Arg Pro 
1250 1255 1260 



Thr Thr 
1265 



<210> 13 

<211> 534 

<212> PRT 

<213> Homo sapiens 

<400> 13 

Met Arg Pro Asp Asp Lys Tyr His Glu Lys Gin Ala Gin Val Glu Ala 
15 10 15 



Thr Lys Leu Lys Gin Lys Val Glu Ala Leu Ser Pro Gly Asp Arg Gin 
20 25 30 



Gin lie Tyr Glu Lys Gly Leu Glu Leu Arg Ser Gin Gin Ser Lys Pro 
35 40 45 



Gin Asp Ala Ser Cys Leu Pro Ala Leu Lys Val Ser Asp lie Glu Pro 
50 55 60 



Thr lie Pro Val Thr Glu Leu Asp Val Val Leu Thr Ala Gly Asp lie 
65 70 75 " 80 
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Pro Val Gin Tyr Cys Ala Gin Pro Thr Asn GXy Met Val Tye Phe Arg 
85 90 95 



Ala Phe Ser Ser Leu Asn Thr Leu Pro Glu Glu Leu Arg Pro Tyr Val 
X00 105 110 



Pro Leu Phe Cys Ser Val Leu Thr Lys Leu Gly Cys Gly Leu Leu Asp 
115 120 125 



Tyr Arg Glu Gin Ala Gin Gin lie Glu Leu Lys Thr Gly Gly Met Ser 
130 135 140 



Ala Ser Pro His Val Leu Pro Asp Asp Ser His Met Asp Thr Tyr Glu 
145 150 155 160 



Gin Gly Val Leu Phe Ser Ser Leu Cys Leu Asp Arg Asn Leu Pro Asp 
165 170 175 



Met Met Gin Leu Trp Ser Glu He Phe Asn Asn Pro Cys Phe Glu Glu 
180 185 190 



Glu Glu His Phe Lys Val Leu Val Lys Met Thr Ala Gin Glu Leu Ala 
195 200 205 



Asn Gly lie Pro Asp Ser Gly His Leu Tyr Ala Ser lie Arg Ala Gly 
210 215 220 



Arg Thr Leu Thr Pro Ala Gly Asp Leu Gin Glu Thr Phe Ser Gly Met 
225 230 235 240 



Asp Gin Val Arg Leu Met Lys Arg He Ala Glu Met Thr Asp lie Lys 
245 250 255 



Pro lie Leu Arg Lys Leu Pro Arg He Lys Lys His Leu Leu Asn Gly 
260 265 270 



Asp Asn Met Arg Cys Ser Val Asn Ala Thr Pro Gin Gin Met Pro Gin 
275 280 285 



Thr Glu Lys Ala Val Glu Asp Phe Leu Arg Ser He Gly Arg Ser Lys 
290 t 295 300 



Lys Glu Arg Arg Pro Val Arg Pro His Thr Val Glu Lys Pro Val Pro 
305 310 315 320 
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Ser Ser Ser Gly Gly Asp Ala His Val Pro His Gly Ser Gin Val He 
325 330 335 



Arg Lys Leu Val Met Glu Pro Thr Phe Lys Pro Trp Gin Met Lys Thr 
340 345 350 



His Phe Leu Met Pro Phe Pro Val Asn Tyr Val Gly Glu Cys He Arg 
355 360 365 



Thr Val Pro Tyr Thr Asp Pro Asp His Ala Ser Leu Lys He Leu Ala 
370 375 380 



Arg Leu Met Thr Ala Lys Phe Leu His Thr Glu He Arg Glu Lys Gly 
385 390 395 400 



Gly Ala Tyr Gly Gly Gly Ala Lys Leu Ser His Asn Gly He Phe Thr 
405 410 415 



Leu Tyr Ser Tyr Arg Asp Pro Asn Thr He Glu Thr Leu Gin Ser Phe 
420 425 430 



Gly Lys Ala Val Asp Trp Ala Lys Ser Gly Lys Phe Thr Gin Gin Asp 
435 440 445 



He Asp Glu Ala Lys Leu Ser Val Phe Ser Thr Val Asp Ala Pro Val 
450 455 460 



Ala Pro Ser Asp Lys Gly Met Asp His Phe Leu Tyr Gly Leu Ser Asp 
465 470 475 480 



Glu Met Lys Gin Ala His Arg Glu Gin Leu Phe Ala Val Ser His Asp 
485 490 495 



Lys Leu Leu Ala Val Ser Asp Arg Tyr Leu Gly Thr Gly Lys Ser Thr 
500 505 510 



His Gly Leu Ala He Leu Gly Pro Glu Asn Pro Lys He Ala Lys Asp 
515 520 525 



Pro Ser Trp lie He Arg 
530 
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<210> 14 
<21X> 409 
<212> PRT 

<213> Bacillus subtilis 
<400> 14 

Met lie Lys Arg Tyr Thr Cys Pro Asn Gly Val Arg He Val Leu Glu 
15 10 15 



Asn Asn Pro Thr Val Arg Ser Val Ala He Gly Val Trp He Gly Thr 
20 25 30 



Gly Ser Arg His Glu Thr Pro Glu He Asn Gly He Ser His Phe Leu 
35 40 45 



Glu His Met Phe Phe Lys Gly Thr Ser Thr Lys Ser Ala Arg Glu He 
50 55 60 



Ala Glu Ser Phe Asp Arg lie Gly Gly Gin Val Asn Ala Phe Thr Ser 
65 70 75 80 



Lys Glu Tyr Thr Cys Tyr Tyr Ala Lys Val Leu Asp Glu His Ala Asn 
85 90 95 



Tyr Ala Leu Asp Val Leu Ala Asp Met Phe Phe His Ser Thr Phe Asp 
100 105 110 



Glu Asn Glu Leu Lys Lys Glu Lys Asn Val Val Tyr Glu Glu He Lys 
115 120 125 



Met Tyr Glu Asp Ala Pro Asp Asp He Val His Asp Leu Leu Ser Lys 
130 135 140 



Ala Thr Tyr Gly Asn His Ser Leu Gly Tyr Pro He Leu Gly Thr Glu 
145 150 155 160 



Glu Thr Leu Ala Ser Phe Asn Gly Asp Ser Leu Arg Gin Tyr Met His 
165 170 175 



Asp Tyr Tyr Thr Pro Asp Arg Val Val He Ser Val Ala Gly Asn He 
180 185 190 



Ser Asp Ser Phe He Lys Asp Val Glu Lys Trp Phe Gly Ser Tyr Glu 
195 200 205 



53/64 

Ala Lys Gly Lys Ala Thr Gly Leu Glu Lys Pro Glu Phe His Thr Glu 
210 215 220 



Lys Leu Thr Arg Lys Lys Glu Thr Glu Gin Ala His Leu Cys Leu Gly 
225 230 235 " 240 

Phe Lys Gly Leu Glu Val Gly His Glu Arg He Tyr Asp Leu He Val 
245 250 255 



Leu Asn Asn Val Leu Gly Gly Ser Met Ser Ser Arg Leu Phe Gin Asp 
260 265 270 



Val Arg Glu Asp Lys Gly Leu Ala Tyr Ser Val Tyr Ser Tyr His Ser 
275 280 285 



Ser Tyr Glu Asp Ser Gly Met Leu Thr He Tyr Gly Gly Thr Gly Ala 
290 295 300 



Asn Gin Leu Gin Gin Leu Ser Glu Thr He Gin Glu Thr Leu Ala Thr 
305 310 315 320 

Leu Lys Arg Asp Gly He Thr Ser Lys Glu Leu Glu Asn Ser Lys Glu 
325 330 335 

Gin Met Lys Gly Ser Leu Met Leu Ser Leu Glu Ser Thr Asn Ser Lys 
340 345 350 



Met Ser Arg Asn Gly Lys Asn Glu Leu Leu Leu Gly Lys His Lys Thr 
355 360 365 



Leu Asp Glu He He Asn Glu Leu Asn Ala Val Asn Leu Glu Arg Val 
370 375 380 



Asn Gly Leu Ala Arg Gin Leu Phe Thr Glu Asp Tyr Ala Leu Ala Leu 
385 390 395 400 



He Ser Pro Ser Gly Asn Met Pro Ser 
405 



<210> 15 

<211> 438 

<212> PRT 

<213> Mycobacterium tuberculosis 



<400> 15 
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Met Pro Arg Arg Ser Pro Ala Asp Pro Ala Ala Ala Leu Ala Pro Arg 
15 10 15 



Arg Thr Thr Leu Pro Gly Gly Leu Arg Val Val Thr Glu Phe Leu Pro 
20 25 30 



Ala Val His Ser Ala Ser Val Gly Val Trp Val Gly Val Gly Ser Arg 
35 40 45 



Asp Glu Gly Ala Thr Val Ala Gly Ala Ala His Phe Leu Glu His Leu 
50 55 60 



Leu Phe Lys Ser Thr Pro Thr Arg Ser Ala Val Asp lie Ala Gin Ala 
65 70 75 80 



Met Asp Ala Val Gly Gly Glu Leu Asn Ala Phe Thr Ala Lys Glu His 
85 90 95 



Thr Cys Tyr Tyr Ala His Val Leu Gly Ser Asp Leu Pro Leu Ala Val 
100 105 110 



Asp Leu Val Ala Asp Val Val Leu Asn Gly Arg Cys Ala Ala Asp Asp 
115 120 125 



Val Glu Val Glu Arg Asp Val Val Leu Glu Glu lie Ala Met Arg Asp 
130 135 140 



Asp Asp Pro Glu Asp Ala Leu Ala Asp Met Phe Leu Ala Ala Leu Phe 
145 150 155 160 



Gly Asp His Pro Val Gly Arg Pro Val He Gly Ser Ala Gin Ser Val 
165 170 175 



Ser Val Met Thr Arg Ala Gin Leu Gin Ser Phe His Leu Arg Arg Tyr 
180 185 190 



Thr Pro Glu Arg Met Val Val Ala Ala Ala Gly Asn Val Asp His Asp 
195 200 205 



Gly Leu Val Ala Leu Val Arg Glu His Phe Gly Ser Arg Leu Val Arg 
210 215 220 



Gly Arg Arg Pro Val Ala Pro Arg Lys Gly Thr Gly Arg Val Asn Gly 
225 230 235 " 240 
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Ser Pro Arg Leu Thr Leu Val Ser Arg Asp Ala Glu Gin Thr His Val 
245 250 255 



Ser Leu Gly lie Arg Thr Pro Gly Arg Gly Trp Glu His Arg Trp Ala 
260 265 270 



Leu Ser Val Leu His Thr Ala Leu Gly Gly Gly Lea Ser Ser Arg Leu 
275 280 285 



Phe Gin Glu Val Arg Glu Thr Arg Gly Leu Ala Tyr Ser Val Tyr Ser 

290 295 .. 300 



Ala Leu Asp Leu Phe Ala Asp Ser Gly Ala Leu Ser Val Tyr Ala Ala 
305 310 315 320 



Cys Leu Pro Glu Arg Phe Ala Asp Val Met Arg Val Thr Ala Asp Val 
325 330 335 



Leu Glu Ser Val Ala Arg Asp Gly lie Thr Glu Ala Glu Cys Gly lie 
340 345 350 



Ala Lys Gly Ser Leu Arg Gly Gly Leu Val Leu Gly Leu Glu Asp Ser 
355 360 365 



Ser Ser Arg Met Ser Arg Leu Gly Arg Ser Glu Leu Asn Tyr Gly Lys 
370 " 375 380 



His Arg Ser He Glu His Thr Leu Arg Gin He Glu Gin Val Thr Val 
385 390 395 400 



Glu Glu Val Asn Ala Val Ala Arg His Leu Leu Ser Arg Arg Tyr Gly 
405 410 415 



Ala Ala Val Leu Gly Pro His Gly Ser Lys Arg Ser Leu Pro Gin Gin 
420 425 430 



Leu Arg Ala Met Val Gly 
435 



<210> 16 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Oligonucleotide 



<400> 16 

aatagaagct tgtcgactga tctatccaaa actg 



34 



<210> 17 
<211> 66 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<4Q0> 17 

aaaagagctc ggccagatct tctagaggat ccaagaattc tgttttatat ttgttgtaaa 60 
aagtag 66 



<210> 18 

<211> 37 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide 



<210> 19 

<211> 41 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide 
<400> 19 

ccccgagctc gtcgaccctt ctcgaaagct ttaacgaacg c 41 



<210> 20 

<211> 43 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<400> 18 

ttttgaattc caagatctcc catgtctcta ctggtgg 



37 



<400> 20 

ttttgaattc aaagaatgag atttccttca atttttactg cag 



43 
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<210> 21 

<211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 21 

tttttctaga ctaggagggg tactcatact cctcggc 

<210> 22 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 22 

cgaatgtcca tcgttgcgaa cctgcagaac ctg 

<210> 23 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 23 

caggttctgc aggttcctaa cgatggacat teg 

<210> 24 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 24 

cgaatgtcca tcgttaggaa cctgcagaac ctg 



<210> 25 

<211> 33 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Oligonucleotide 
<400> 25 

caggttctgc aggttcctaa cgatggacat teg 
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<210> 26 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 26 

tcgcagagaa cggatggc 18 

<210> 27 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<210> 28 

<211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 28 

ttttctcgag aaggtggaac atactgccct gggatgg 37 



<210> 29 

<211> 38 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<210> 30 

<211> 40 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<400> 27 

ttttgggccc ttcatggtga tacggtatct cttggc 



36 



<400> 29 

ttttgagctc gtttaggaaa cgtccttggc ggagatgc 



38 



<400> 30 

tttttctaga cactgcgaat ccatggtata aaccaaaacc 



40 
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<210> 31 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 31 

gtcgttgttc atggacatac ctcc 24 



<210> 32 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<210> 33 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 33 

ggttcatatg cgccggagct cctcgacagc ag 32 

<210> 34 

<211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<210> 35 

<211> 70 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<400> 32 

tacaaatgtt cttctgccat ttctgg 



26 



<400> 34 

ggttcctagg atccgcaagt ttgattccat tgcggtg 



37 



<400> 35 

ttaaagagta ccttggctat agaataccgt agagataaag acctgaatag agattgtact 60 
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gagagtgcac 



<210> 36 
<211> 70 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 36 

aggtattata actatttttc tgtatttttt 
tttcacaccg 



<210> 37 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 37 

ctttggttaa agagtacctt ggc 



<210> 38 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 38 

tactacgaaa agcgtgtgcg agg 



<210> 39 

<211> 71 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 39 
tagaaggcta ctcaaaagaa taaagttact 



atatattttt atttgccaag ctgtgcggta 



ataaaatata ctgcggtata tagattgtac 



tgagagtgca c 
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<210> 40 
<211> 70 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 40 

gatcggcaag aaactttgaa gcagtatatt tacaggatta aattatatat ctgtgcggta 60 
tttcacaccg 70 

<210> 41 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 41 

cggaggggct ctatgataaa gg 22 

<210> 42 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 42 

gagtaactag ggcttctctt ccc 23 

<210> 43 

<211> 85 

<212> PRT 

<213> Homo sapiens 

<400> 43 

Ser Gly Leu Gin Arg Ala Glu Glu Ala Pro Arg Arg Gin Leu Arg Val 
15 10 15 

Ser Gin Arg Thr Asp Gly Glu Ser Arg Ala His Leu Gly Ala Leu Leu 
20 25 30 



Ala Arg Tyr lie Gin Gin Ala Arg Lys Ala Pro Ser Gly Arg Met Ser 
35 40 45 
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lie Val Lys Asn Leu Gin Asn Leu Asp Pro Ser His Arg lie Ser Asp 



Arg Asp Tyr Met Gly Trp Met Asp Phe Gly Arg Arg Ser Ala Glu Glu 



Tyr Glu Tyr Pro Ser 
85 



<210> 44 

<211> 22 

<212> PRT 

<213> Homo sapiens 



<400> 44 

Gin Leu Arg Val Ser Gin Arg Thr Asp Gly Glu Ser Arg Ala His Leu 
15 10 15 



Gly Ala Leu Leu Ala Arg 
20 



<210> 45 

<211> 19 

<212> PRT 

<213> Homo sapiens 



<400> 45 

Val Ser Gin Arg Thr Asp Gly Glu Ser Arg Ala His Leu Gly Ala Leu 
15 10 15 



Leu Ala Arg 



<210> 46 

<211> 51 

<212> PRT 

<213> Homo sapiens 



<400> 46 

Tyr lie Gin Gin Ala Arg Lys Ala Pro Ser Gly Arg Met Ser lie Val 
15 10 15 



50 



55 



60 



65 



70 



80 



Lys Asn Leu Gin Asn Leu Asp Pro Ser His Arg lie Ser Asp Arg Asp 
20 25 30 
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Tyr Met Gly Trp Met Asp Phe Gly Arg Arg Ser Ala Glu Glu Tyr Glu 
35 40 45 



Tyr Pro Ser 
50 

<210> 47 

<211> 17 

<212> PRT 

<213> Homo sapiens 



<400> 47 

Tyr He Gin Gin Ala Arg Lys Ala 
1 5 



Pro Ser Gly Arg Met Ser He Val 
10 15 



Lys 



<210> 48 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 48 

Tyr He Gin Gin Ala Arg Lys Ala Pro Ser Gly Arg Met Ser He Val 
1 5 10 15 

<210> 49 

<211> 13 

<212> PRT 

<213> Homo sapiens 



<400> 49 

Asn Leu Gin Asn Leu Asp Pro Ser His Arg He Ser Asp 
15 10 



<210> 50 

<211> 23 

<212> PRT 

<213> Homo sapiens 



<400> 50 

Asn Leu Gin Asn Leu Asp Pro Ser His Arg He Ser Asp Arg Asp Tyr 
15 10 15 



Met Gly Trp Met Asp Phe Gly 
20 
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<210> 51 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<400> 51 



Asn Leu Gin Asn Leu Asp Pro Ser His Arg lie Ser Asp Arg Asp Tyr 
15 10 15 



Met Gly Trp Met Asp Phe Gly Arg Arg Ser Ala Glu Glu Tyr Glu Tyr 
20 25 30 



Pro Ser 



<210> 53 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Portion of fusion protein 
<400> 53 

Lys Arg Glu Ala Glu Ala Ser Gly Leu Gin Arg Ala 
15 10 



<210> 54 

<211> 9 

<212> PRT 

<213> Homo sapiens 



<400> 54 

Arg Met Ser lie Val Lys Asn Leu Gin 
1 5 



<210> 


55 


<211> 


13 


<212> 


PRT 


<213> 


Homo 


<400> 


55 



sapiens 



Asp Arg Asp Tyr Met Gly Trp Met Asp Phe Gly Arg Arg 
15 10 
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Figure 2 
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Figure 3 



4/13 



Modtaget PVS 
2 0 SEP. 1302 

A 




Figure 4 



S/13 



Mocftaget PVS 
2 0 S£P. fte 




Figure 5 



6/13 



Modtaget pvs 




Figure 6 



7/13 



Modtaget Pvs 
2 0 SEP. 2002 



B 



100 -| 



60 




■nln 



Figure 7 



8/13 



Mocftaget PVS 
2 0StP.2QQ 2 



Cells 



Media 



Total CCK 
|nM) 



j-3E-i 




-3E- 






n 




w 

n 



BJ2M nmi 



B 



BRIM 4e »- nwJ 



Fraction 
CCK-22 



C 



i 1 1 La 



Bill** *** 0«' , 



a 



Figure 8 



9/13 



Modtaget PVS 
* 0 SEP. 2002 




Figure 9 



10/13 



Mocftaget PVS 
* 0 SZP. ZCC2 



Cells 



Media 



Fraction 
CCK-22 « 



i 



Figure 10 



11/13 



Modtaget PVS 
2 0 SEP. 2002 



1SO-. 



g100 

u 

1 

so 



B 



✓ /" / X 



Figure 11 



S| S |0 



in 



ivioataget PVS 
2 0 ScP. 20C2 



S 



IN 

in 



CO 
< 



CO 
CO 
CO 



CO 

u 
o 



ro 

2 



VD 

ro 
o 



VO 

I 



fM 



O 

in 

§ 



in 



o 
o 



IS 

fM 



to 



ro 
ro 



ro 
ro 



oo 



2 

8 



M 



CM 



CM 



0) 

ro 

8 



00 

in 
■ 

u 
u 



8 



! 

a 



A) 

M 



B) 

M 



Modtaget PVS 
2 0 SEP. 2C02 

13/13 



KREAEASGLQRA RMSIVK«yft 

I > , 7 \ 7 g 

^ Cymlp activity 

NIO DFG 

I 1 



Cly-cxtended CCK-22 



KREAEASGLQRA RMSIVK 61 HAE 



I preproMfttlp // | CCKQ1-61) j GLPI | 



GLP1 



^ Cymlp activity 
f HAE KGR 

l~ GLPI I 



10 



15 



Figure 13 



